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MEMORANDUM 


If the modern student, who aspires to be an Actuary, is faced with 
a wider range of actuarial subjects and a more searching test of his 
understanding of actuarial principles than was the student of an 
earlier generation, he has this compensation—that the means to 
acquire such understanding are more readily available and put into 
a form which may more readily be assimilated. 

This book is such an example of the help afforded to the actuarial 
student, and he is fortunate to have the opportunity to study a work 
of this merit at the very beginning of his Course. A complete grasp 
of its principles will not only assist him to satisfy his examiners as 
to his proficiency in the mathematical problems by which he will be 
tested, but when he has so satisfied them he will have laid such 
foundation for his future work that he will ever have cause to be 
grateful to the author of this treatise. 


H. J.P. O. 



NOTATION 


PfX^x) a polynomial (i.e., a rational integral function) of degree 
n in X, 


^ir) 




n(n~ i) (n — z) 1) _ 

r ir~ ifCr - 2) Z 2.2.1 ~ rl 

or negative, integral or fractional. 


j , where n may be positive 


This is represented in other works by the symbols or 
w,.; sec y.LA. vol. lxiii, p. 58. 
n(n—i)(n~2)...(n~r-i-i). 


There seems to be no recognized symbol for the more 
general factorial n (n ~h){n — zh) ~ ih). It may 

sometimes be convenient to represent this by the same 
symbol but in that case the symbol must be specially 
defined and consistently used. Cf. posty p. 19. 


(w + i) (w 4 - 2)... (n + r) + 

A different notation is employed in certain other works— 
SQQ post, p. 19. 


Mua the divided difference of order i. This notation is explained 
^ and reference is made to other notation|Htti Chapter III. 

Paragraphs and examples marked with an asjf^i^, thus are 
intended for the advanced student only, and need not be read by 
students preparing for Part I of the Institute Examinations. 
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AUTHOR’S PREFACE 

As explained in the Preface to Part I of this book, the revision of the 
syllabus for the Examinations of the Institute of Actuaries rendered 
it necessary to divide An Elementary Treatise on Actuarial Mathe¬ 
matics into two parts and to make additions to that part of the book 
intended for students who were reading for the first Part of the 
Examinations. The opportunity has been taken to revise thoroughly 
the Chapters on Finite Differences and Probability and to bring the 
book up to date. While little alteration has been made in a few 
Chapters of Actuarial Mathematics the majority have been entirely 
rewritten, and it is therefore advisable to refer to each Chapter of 
the present book in detail. 

Chapters I and II do not differ markedly from the first two Chap¬ 
ters on Finite Differences in Actuarial Mathematics. Chapter III, 
on Interpolation with Unequal Intervals, has been rewritten. The 
introduction of Dr Aitken’s notation has simplified the general pro¬ 
blem of divided differences, and as a result a more logical exposition 
of the theorems connected with these differences has been given. 
The earlier parts of Chapter IV, on Central Differences, have been 
remodelled, and much new matter inserted. With regard to Chap¬ 
ter V as given in Actuarial Mathematics it was thought that the 
treatment of the principles of Inverse Interpolation was to a certain 
extent incomplete. Dr Aitken suggested lines on which improve¬ 
ment might be made and the initial paragraphs have therefore been 
redrafted. Considerable alterations have been made in the Chapter 
on Summation^ tf^^r^ few paragraphs have been recast, a new 
section dealing wit^ the application of operators to the summation 
of algebraic series^ ba^ been introduced and paragraphs on the 
operator [n] have h^fi inserted. Chapter VII, on Miscellaneous 
Theorems, has been enlarged by the introduction of new material 
and a fuller treatment has been given of the matter in the previous 
book. Chapter VIII is entirely new. It was considered that there 
was now an opportunity to collect in one Chapter 'a number of 
modern methods and special devices which have recently appeared 
from the pens of Mr Lidstone and Dr Aitken, but which are scat¬ 
tered over the pages of various Journals. The Chapter on Approxi- 
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mate Integration has been amended in order to clarify certain proofs 
and theorems, but is otherwise unaltered. 

In view of the introduction of Elementary Statistics into the 
syllabus for Part I of the Examinations it was considered advisable 
to approach the study of Probability from a slightly different angle. 
As a consequence, Chapter X has been largely rewritten. New 
Chapters on Elementary Statistics have been introduced: these 
Chapters do not deal exhaustively with the subject, but are, it is 
hoped, sufficient to enable the student to solve simple types of pro¬ 
blem met with in practical work. Little alteration has been made 
in the Chapter dealing with Mean Value and the application of the 
Calculus to Probability. 

The Examples are mainly those which have already appeared in 
Actuarial Mathematics. Examples ii and 12 contain exercises in 
Statistics and the Miscellaneous Examples contain many new 
questions. 

In the preparation of this Part of the book I have received much 
help from various sources. I have had access to the notes written 
for the Actuarial Tuition Service and some of the Lessons on 
Probability and Statistics (prepared originally by P. M. Marples) 
have proved of value. My thanks are due to Mr H. Tetley and 
Mr O. C. J. Klagge for helpful suggestions on certain points. 

It is, how'ever, to Dr A. C. Aitken and Mr G. J. Lidstone to whom 
I am most indebted. Dr Aitken generously put at my disposal his 
notes on various subjects connected with Finite Differences. In 
particular, his suggestions on Inverse Interpolation, the divided 
difference notation and the methods of cross-means have been such 
that I was able to adopt them with little alteration. Throughout the 
preparation of the book, from the earliest stages to the final proof¬ 
reading, Mr Lidstone has been my constant guide and mentor. 
A great deal of the work has been inspired by him and his comments 
and criticisms on all points have been invaluable. He has read 
through and annotated the whole of the book with that thoroughness 
for which he is justly famous, and I am deeply grateful to him for 
his help. 

H. F. 

March 1939 
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CHAPTER I 

DEFINITIONS AND FUNDAMENTAL 
FORMULAE 

1 . The function y —a -f kx'^ is a rational integral 
function of the nth degree in Xy where the indices are positive 
integers, n being the greatest, and a^by c ... k are constants, of 
which A=#o but the others arc unrestricted. 

A rational integral function is also called a polynomial, and it 
is convenient to represent a polynomial of the nth degree as 
Pni^)- 

Consider the simple polynomial y — -\-x-hx^. It is quite 

easy to obtain the value of y corresponding to any value of x by 
substituting that value of x on the right-hand side of the equation. 
For example 

.V012345678 
I 3 7 13 21 31 43 57 73 

It will be found that for successive integral values of .t in the 
above table the values of y have interesting properties. If from 
each value of y the previous value of y be subtracted, we obtain a 
new set of figures: 

(a) 2 4 6 8 10 12 14 16 

and if the subtraction be performed on these figures in the same 
way the new differences are 

(^) 2 2 2 2 2 2 2 

The sequence of 2’s in (^) is not a mere coincidence: it will be 
shown later that when y has the value supposed all the terms in 
(jS) have the same value, 2, however far the series extends. 

This leads us to another method of obtaining values of y. Sup¬ 
pose that we write down the original table in a different form, and 
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include in the table 

X 

O 

1 

2 

3 

4 

5 

6 

7 

8 


two sets of figures 


y 

(«) 

I 

2 

3 

4 

7 

6 

13 

8 

21 

10 

31 

12 

43 

14 

57 

16 

73 



a) and (j 3 ) thus: 
(B) 

2 

2 

2 

2 

2 

2 


We can now find any further value ofy by extending the columns 
(a) and (jS). We must however work from (^8) to (a) and then to 
y instead of from to (a) and then to (^) as has already been done. 
For example, to obtain the value of y wlien x has the value 9, i.e. 
to obtain a new 2 must be inserted in the (j8) column: the new 
value in the (a) column will be 16 + 2= 18, and the required value 
of jv will be 73-h 18 = 91. To find z/jo the process is continued. Any 
value of y corresponding to an integral value of x can be obtained 
in a similar manner. 

The above is a particular instance of a far more general set of 
operations. We have used the simplest possible numerical values 
of X, namely the natural numbers, and we have evolved our 
example from a known quadratic function y^u^—i+x-yx"^. As 
a general rule the form of the function is not known and the given 
values of x are not necessarily consecutive integers. 

Now suppose that instead of numerical values of x differing by 
unity we have the following consecutive values of x\ 

a, a + a + 2h, a + •••> 
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where the values of x differ by a quantity h instead of by 
unity, 

'Fhen if the function be still j = the values of y corresponding 
to the above values of x will be 

In order to form a column similar to column (a) above we shall 
have to write down 


^a+h ^a+2h ^a+2fn 

These are called the first differences of the function y — Uj, and 


are denoted by 




where A is not a quantity but a symbol representing an ‘‘operation 
Column (^), being the differences of column (a), will be 

iMa \ 2h \ /<) i h 
(^(M 3 A ^«+2 a) i 2 h 


or, more shortly, 


f A> 


These are called the second differences of and are denoted by 

I hy \2h * * *> 

where, it must be emphasized, the symbol A'^ does not represent the 
square of a quantity but denotes the repetition of the operation A. 

Similarly, third, fourth, ... wth differences, formed in exactly the 
same way, are denoted by 

... A«m„. 

4 . Before forming a difference table similar to that in paragraph i, 
it is convenient to introduce alternative names for a; and y in our 
equation y == Where our ultimate object is to obtain numerical 
values of x or jy, the independent variable is often termed the 
argument^ and the corresponding value of y the entry. 


1-2 
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In a table of logarithms the number itself is the argument and the 
logarithm the entry. The converse holds in a table of antilogarithms, 
where the logarithm is the argument. Similarly in a table of sin a, a is 
the argument and the sine the entry, whereas a is the entry in a table of 
sin-^ a. 


5 . Our new difference table is therefore 




First 

Second 

Third 

Argument 

Entry 

differences 

differences 

differences 

a 





a + h 








A 3 «„ 

a + zh 

^a+ 2 h 






2 h 



a + ^h 



^ ^a+ 2 h 






AX+2A 


^a-f-4/i 


AX+ 3 A 


a -f* 










The first term in the table (w^) is called the leading term, and the 
differences which stand at the head of the respective columns, 
namely Aw^, are called the leading differences. 


6 . Although we have expressed the terms in the difference table 
by the use of A symbols, it is quite easy to obtain any difference 
in terms of the function alone. 

For example, is the difference between and 

or = 

Again, is the difference between Am„+,, and Am„, or 

AX = AMa+A-AM„, 

and as Am„ = - m„, 

we have A®m„ = — A^u„ 

= (^««4 2/i - - (^«a+A “ A«„) 

~ {^a+3h ~~ ^a+2h) ~ ^ {^a+2h ~ ^a+h) d" {^a+h ~ 

“ ^a+3h ~ 3*^a+2A "h 2^a+h ~ 
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TABLE 

5 

7. It is a simple matter to construct 

a difference table from a given 

set of data. 





Consider the following examples: 



Example !• 





Construct a 

difference table from the following values, where y is a 

function of x : 





X 

y 

Ay 

A‘V 

A='y 

I 

I 

7 



2 

8 

19 

12 

6 

3 

27 

37 

18 

6 

4 

64 

61 

24 

6 

5 

125 

91 

30 

6 

6 

216 

127 

36 


7 

343 




Example 2. 





Show that, in the following table of annuitv-values, third differences 

are practically constant: 




Argument x 

Entry 

Ac/, 


A^c/, 

35 

14-298 

“•^54 



36 

14-144 

-•158 

— -004 

+ -OOI 

37 

13-986 

— -161 

-'OO3 

•000 

38 

13-825 

— *164 

-’OO3 

+ -OOI 

39 

13-661 

— •166 

-002 

+ •001 

40 

13-495 

— -167 

— -OOI 


41 

13-328 





It will be observed that in Ex. i third differences are invariably the 
same. In the second example, however, third differences are not quite 
constant, although the error in assuming them to be so is very small. 
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The difference in the two examples lies in the fact that, while the 
first function is y — the table of annuity-values from which the data 
in the second example have been taken does not conform to a simple 
mathematical law and, further, the values do not naturally terminate 
with the third decimal place, but are rounded off at that place. 


Example 3. 

Assuming third differences constant, find the values of i/g and from 
the data: 

^456 7 8 

Ux *35 1*71 2-90 4-51 


Construct the difference table from the given values, and fill in the 
vacant spaces in the column with the constant third difference, 
thus: 


X 


Au ^ 



2 

- 05 

•11 



3 

4 - 06 

•29 

CO 

•06 

4 

+ *35 

•53 

•24 

•06 

5 

-f «88 

•83 

•30 

•06 

6 

1*71 

1-19 

•36 

•06 

7 

+ 2-90 

i-6i 

•42 


8 

+ 4-51 





8. Now it has been stated above that a convenient method for 
expressing the difference between two successive values of a 
function and is by the symbol A prefixed to u^y so that 
It will be seen therefore that to find Aw^ we per¬ 
form two operations: we change to and subtract from 
it. The new function resulting from the first of these opera¬ 
tions is denoted symbolically by and the double operation 
may be written ^u=Eu.-u.. 


This gives 


Eu„ = -f Am„ 


Eu^ may therefore otherwise be expressed as the sum of and 
its first difference. 
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7 

Suppose that in either of the above relations the which occurs 
in each of the terms be omitted. Then we can state that the two 
operations denoted by and “A’' are connected by the 

symbolic identity — i + A 

It must be distinctly understood that we have not “factorized 
out” in the relation and that we must relate the 

symbols to the functions on which they operate. If, therefore, we 
were using the equivalence A ^ /s — i, and we operated on the 
function sin x, it would be wrong to say that A sin x~E sin .x:— i. 
The correct statement is A sin v = A’siii^c —sin x:. When we are 
dealing with symbols of operation we cannot treat any of them as 
quantities, and on forming the algebraic or trigonometrical identity 
the function must be included in all three terms. In other words, 
in the identity is = i+A the i is a symbol of operation just 
as are E and A, and its meaning is that the function on which it 
operates is to be taken once without alteration. 

9 . In the same way as A^ denotes, when operating on a function, 
the difference of the difference of the function, i.c. the second 
difference, so den tes the operation of repeating E. That is to 

= E.Eu^ = Eu^^,, = 

and, generally, = 

Care must be taken not to confuse the expression E'^Uj, with 
(Eu^)^, For example, 

£•2 — + zlif — + d^hx -f 

but {ExY = (:v -f Kf ~x^-{- 2hx + 

10 . It is evident that the first difference of a function of the form 
cXy where c is a constant, is constant: for ^cx = c (x + h) — cx = chy 
which is constant. 

Let us consider the effect of differencing a function of x of 
higher degree than the first. 
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Example 4. 

Difference successively the functions (i) y = hx^ and (ii) J 

(i) bJ)x^ = ^ (jc + — hx‘^ = ^hhx 4- 

b^hx"^ = A {zhhx + hhr) — zbh (a: + /r) + bh^ — zbhx — bh^ = zhh'^y 
and since zbh^ is constant all higher differences will be zero. 

(ii) b.ax^ — a{x-\-hY — ax^ = 2t^hx‘^ + + ah^y 

bi^ax^ = 6ah^x + 6ah'^y 

and bi^ax^ — 6ah^y higher differences vanishing. 

Collating the above results, we have that 

the first differences of functions of the form cx are constant, 
the second „ „ ,, bx^ ,, 

the third ,, „ „ ax^ ,, 

It follows therefore that third differences of ax^ + bx'^ + cx-i-d 
iare constant, for before we reach the third differences the terms 
bx^y cx and d will have been eliminated. 

11. The above considerations lead us to the following important 
proposition: 

If Uy, be a polynomial of the wth degree in Xy then the wth 
difference of the function is constant. 

Let the function be 

Uj. — ax'^ + bx^^~^ + ex’* -h... + 5; 
then Aw^ — a{x-\- hY + b {x + hY'^ c (x f 

— flx’* — — cx^~^ —... — 5 

= anx^^~ 4- b'x^^^ 4- e V*^"^ -f... 4- r', 

where b', c\ ... r’ are coefficients involving h but not x. 

Similarly, 

= an{n—i) + b”x^~^ 4- c'x^~^ +... + g'", 

and so on. 

Each time that we difference we lower the degree of the function 
by unity. After differencing n times no terms after the first will 
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appear, and we shall be left with 

{n-i){n- 2 ) (w-3) ... 2.1 or an\h^, 

which is independent of x and is therefore constant. 

As a corollary we may note that = a property of a 

polynomial of the nth degree which is of value in the practical 
application of the work. 

The converse proposition is of importance: if the (n + i)th differ¬ 
ence of a function is the first to become zero, the function is a 
polynomial of not more than the nth degree. 

12 . It should be remembered that we are dealing here with a 
particular form of function. Should the function be other than 
a polynomial the nth difference will not vanish however great n 
may be. Thus, we have 

Example 5, 

Find the nth difference of e^. 

— e® = e® (e^ — I), 

A‘^«® = — I) — e^) = e® (e^ - i )^. 

Similarly, A^^® = (e^ — i 


Generally, A”e® = e® which is still a function of Xy and is 

therefore not constant. 

r 3 . Although it has been said that the symbols A and E are in no 
sense algebraic quantities, our definitions, namely that A” denotes 
the operation of differencing the function n times, and that 
denotes the operation of obtaining a new function when the argu¬ 
ment is increased by n unit differences, enable us to apply to these 
symbols the ordinary algebraic laws. For example, 

A + Uy) — + ^y + h ^X+h "h Uy_I^J^ — Uyy 

which is AZ/3. +Awj,. This relation is exactly similar to the ordinary 
algebraic identity 3 (jc+jv) = 3^ + 3J'. 
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The three simple algebraic laws are the laws of (i) distribution, 
(ii) commutation, (iii) indices. 

O') (Ux + ^x + ^x+-) 

= («X t A - - ^x) + (a^x+A - a’x) 

= Au^ -f Av^ + Aw^ H-.... 

Similarly, 

E (u^ + v^ + w^ + ,..)=^Euj^-^Ev^, + Ezv^ +.... 

(ii) The symbols A and E are commutative in their operation as 
regards constants. For if c be a constant, 

Acu^ = ft - Cll^ == C (Mx-, ft - Mx) = <^A«x. 

and EcUj^ — = cEti^. 

^ (iii) The application of indices to the symbols A and E may be 
shown thus: 

If m be a positive integer, then A^^ represents the operation of 
differencing m times. 

A^''Wp = (AAAA ... m times) w^, 

(A'"'w^) = (AAAA ...n times) (AAAA ... m times) 

— (AAAA ... 7w + w times) 

Similarly, FX = ax+mft- 

= = ax+mft+r,ft = -^’'*+"ax- 

/ 14 . In connection with the law of indices we must be careful to 
define A^^\ A”, £'^, ... when m and n are not positive integers. So 
far, the symbols A’” and are intelligible only when we can 
actually perform the operations defined above and obtain the values 
of the new functions. We have not yet defined these symbols when 
the indices are negative. Consider for example the symbol A”^. 
Since we have assumed that the symbol A obeys the ordinary 
algebraic laws, A~^ must be such that A (A~^mJ gives A^w^., i.e. 

Let w be a positive integer. Then we define A"^u^ as a function 
such that if it be operated on by A’” the result will be 
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i.e. u^. In the same way we have a meaning for namely, 

that operating on produces u^. But if m be a positive 

integer, E^ operating on produces I'herefore 

the same result is obtained by operating with on E~'^^u^ as on 
^x-rnh • In other words just as E^^u^ gives ^ so E ~^^u^ gives . 

The symbols E and A may be manipulated in a manner similar 
to algebraic quantities provided that it is always remembered that 
they are operators and that they have no actual values. There arc, 
however, two important points in which algebraic precedent cannot 
be safely followed. These are: 

(1) Operators arc not commutative with regard to variables. 

E.g., A does not as a rule equal 

(2) It is fundamental in algebra that if a function vanishes, then 
one of its factors must vanish. It is not true that if the result of a 
series of operations on is equivalent to o.u^ (i.e. zero), then 
some one of the operations on must produce o. For example, 
if ^^ = 0, then x — o; it does not necessarily follow, however, that 
if A 2 = o, then A = o. 

In many problems it is convenient to use operators alone and to 
omit the functions on which they operate. Where this practice is 
followed the sign = (is equivalent to) should be adopted in place 
of = (equals). Thus, Eu^ — {i -f A) but £ = (i + A). 

For further information on the difficulties connected with the use of 
operators the student may refer to J.S.S, vol. ii, pp. 237 et seq, (S. H. 
Alison). 

^ 15 . Proceeding from the definition of differencing, it has been 
shown that 

2 h ~ T ^^x-^h 
= m^ + Aw^+A (w^ + AwJ 
-«^ + 2Aw^ + AV 

^x^$h ~ ^x+ 2 h T ^^x+ 2 h 

= + AX + ^ K + 2Aw^ -f A^WJ 

= + 3 Aw^ + 3 AX + 
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The coefficients of the various terms in these expansions are the 
coefficients of x in the expansions of (i+x), (i+^)® by 

the binomial theorem. If we assume, for positive integral values of 
n, that the general relation between and and its differences 
follows the same law, we can prove the truth of the assumption by 
the method of mathematical induction. 

Assume therefore that 

+ ^(l)^^x -^^( 2 )^% + ... + W(,) + ... + AX 

is true for the value «. 

Then, since «a:+(„+i)A = “x+«A+^«a:+«A. 
we have 

^x+(n-i-m = + ”(1) ^^x + 4-... + w (,.) A^w^ +... -f 

+ A X + Wo )^^X + «(2)+ • • • + W(,)AX +... + Ax) 
= -f Aw^ (wo)-f I) + AX («(2) + wo)) +... 

+ AX {fhr) + riir-i)) +... + 

But W(,) + W(,_i) = (w+i)(,), 

therefore 

Wx+(n+l);. = ^:r + (w 4 -l)a)Aw^ + (w 4 -l){ 2 )AX+--*+(^+l)(r)AX 

+ ...+A^+’m^, 

which is of the same form in (w4-1) as was the original expression 
in w. 

Therefore if the assumption is true for n it is true for w +1. 

But the theorem holds when w= i, 2, 3. 

Therefore it is true when w = 4, 5, ... and for all positive integral 
values. 

Therefore, for positive integral values of w, 

Ux+nh = Kx + + «(2) + «(3) + ... + n(,.)A’-M, +... + AX- 

r 

16 . When the relation between the operators A and E was dis¬ 
cussed it was stated that our definition of these operations enables 
us to apply the ordinary algebraic laws to these symbols. We may 
therefore use the equivalent relation 

i's I “h A, 
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and if we operate on the function we shall have 

' =(i+«(i)A-f«(2)A2+...+«(,)AM-...+A^) 

If we introduce the fact that the symbols follow the algebraic 
distributive law, we may write 

^x+nh + + «(2) AX -f... + W(,) AX +... + A’» 

which is the relation proved above for positive integral values of n. 

This result is true whatever the form of the function so long as 
w is a positive integer. If n be other than a positive integer we 
cannot adopt the binomial expansion without further investigation. 
For the purposes of this chapter it will be sufficient to assume that 
the relation £'” = (1 +A)"~ i -fW(j)A-f W(2)A2 + /7(3)A^4-... holds 
without restriction. The conditions of the validity of the expansion 
will be discussed at a later stage (sec Chap. II) 

>< 17 . We are now in a position to state that if n-hi consecutive 
values of a polynomial of the wth degree are given, then, by the 
method of finite differences, we can obtain the actual function in 
the form 

«^ = M„ + X(1) A«o + Xfe) AX+•••+■>?(„) 

or = +J 5 .V(i) + CjC(2)+...+ 

where the coefficients A, K are obtained by inspection of 

a table of differences. 

Now if we are given « +1 corresponding values of x and it 
does not immediately follow that Uj. is a polynomial of the «th 
degree. 

For example, suppose that we have the following data: 

X? o I 2 3 4 5 

I 4 9 16 25 36 

Since six values are given there are the following possibilities: 

(i) they may be actually given as values of the function (i +a:)2 ; 

(ii) they may be given as values of a polynomial of the second 
degree in x, and it may be required to find the function; 
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(iii) they may be given as values of a polynomial of the nth 

degree^ where n is less than 6, and it may be required to find the 

function; 

(iv) they may be given as values of a polynomial without any 
information as to degree; 

(v) no information regarding the nature of the function may be 
available. 

The answer to (ii) and (iii) is obviously u^ — {i 

The answer to (iv) is = + x (x—i) ... or 

(i where is a polynomial in x which does not 

become infinite at any of the points o, i, 2, 3, 4, 5. The function 

... ix-s)F^ 

will then obviously vanish for these values. 

The answer to (v) is the same as to (iv) except that F^. need not 
be a polynomial. 

It is of importance to realize that we can always find a value for 
Fj. which will make the function = (1 -f jc)^ + X(e)Fx ^gree with any 

additional value whatever. For example, if ^4.5“ 19*75 the function 
'f-x;)2 4-.V(Q)2^ will agree with the given values and also with 
the additional value which has been inserted at the point x = 4*5. 

Conversely we can say that whatever be the complete form of 
the function of which the six given values are samples, the value 
at any other point is the value of the function (i +a:)‘^ at that point 
with an error X(^^^)FJ.. Whether the value is a good approximation 
or not depends on the magnitude of and we may or may not 
have reason to suppose that F^ is so small that it can be neglected. 
It should be understood that we are not at liberty to say that 
(i +^)^ gives an approximate value at the point in question unless 
we can give such a reason, based either on theory or on experience. 

The matter is further investigated in later paragraphs, but it may be 
said that in most practical cases F^ is of the same order of magnitude 
as for some value of x in the range under consideration. It may 
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in fact be shown that Fj.— , / , where f is a quantity fallinp; in the 

range which includes x and the given values of u. (See Chap. Ill, 
paragraph 17,) 


" 18 . If instead of writing = and expanding this by the 

binomial theorem, we write = and expand, a new series 

is obtained: 

A^u^ = (E-iy'u^ 

= [E'^^ -~ i)'' n(,)E^'-^ 

" ^x+nh~' ^ (2)^x+in-2) h “ ••• 

+ (— l)^ ^hr)^^jrh(Ai-r)/4 + ••• +(— 

Just as the relation established in paragraph 15 enables us to 
obtain the value of in terms of and its leading differences, 
so the above relation gives any required difference of the function 
in terms of successive values of the function. 


19 . A few' simple illustrations of the use of these two formulae are 
given below. 

Example 6. 

Find 2/g, given 7/^= —3, 7q = 6, 7/2 = //., = 12; third differences 

being constant. 

The leading differences are easily found to be Au^^ = g\ A^w^= - 7; 

AX = 9 - 

Uq-{i -f A)*^ + 6A4-15 a®- f 2 oA^) u^^ 

~ 7 /^) + 6 A 7 ^q + 15 A“ 7 /o -f 2 oA^ 7 /j, 

= — 3+ 54— 105 + 180= 126. 

Note. There is no need to continue the expansion beyond third differ¬ 
ences, as further differences are zero. 

Example 7. 

Find «2, given 7/4 = o, = 3, Mq = 9; second differences being constanl. 
Here the initial term of the known series is 7/4, so that in order to find 
W2 we must use the relation 

+A)-- W4 = (i ~2A + 3 A2) W4, 

as far as second differences. 
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U2 — U^ — 2AW4 + 3 

= 0-64-9 = 3, 

since A«4 = 3 and A2^/i = 3. 

Example 8. 

From the following values of u^, calculate A^UqI 

^ 0 ~ 3 y ^1=12, 2/2 = ^3 = 200, ^4=100, ^6 = 8. 

Since we require only one value of A^m^., we do not need to form a 
difference table, but may write at once 

= Uq 

= (F/^ + loE^ - 4 - sit - I) uo 

= E^Uq - sE^Uq 4- ioE\^ - ioE'% 4- 5/fWo - «o 
W5 - 5W4 4-10^3 - 10W2 4-5^1 ~ Wo 
= 755- 

Note. Before we can find the fifth difference six terms of the series 
must be given. 

^ 20. Separation of symbols. 

In obtaining the value of in terms of and its differences 
we have used the symbolic relation if — i 4- A and have expanded 
(i 4- A)^ by the binomial theorem without introducing the function 
ii^ until the last stage. This method, in which in fact is omitted 
from both sides of the identity, is known as the method of separation 
of symbols^ and enables many relations involving and differences 
of to be readily established. It must how^ever be remembered 
that the operators cannot really stand alone and that the operand 
iij^ is always understood. 

Example 9. 

Show that 

Wo4-zq4-//2 4-...4-ttn 

= («4' i)(i)Wq4-(w4- i)(2)AWo 4-(/^ 4- i){3)A®ttj)4“... 4 -A’’Wq, 
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Wo H- % + «2 4*... 4- 

- Wo 4- Euq + E^Uq + ... 4- £'”Wo 
= (i 4-^4-^2 + ...4-/i”) Wo 

_j 

= - UQy or substituting i -f A for 

_(i+Ar+^-i 
~ A ^ 

” A 4-(w 4-i)(i)A+ (w 4- i)(2)A2 4-(w4- i)( 3) A-'^4-... 4-A"'+^— i] 

= [(f^ + I)(i) 4- (w 4- l)(2) A + (w 4- l)(3) A“ 4-... 4- A"] Wo 
= (w 4-1)(i) Wo 4- {n 4-1 )(2) Awo 4- (w + I)(3) A^Wy 4-... 4- A^'^Wo- 

Example 10. 

Prove by the method of separation of symbols that 

^^x—i 4" Aw^_ 2 4” A^w,.„3 4* A W2 p_ 4 4“... 4‘ A^*^ 4* A^w^j^,^. 

( /A\’^) A’”— A'" 

W^ A^ ~ A^* A ” Wjp = |l I" ~ y,’~n ~ 

= £’-n (y/;, ~1 ^ ^J.^n-2 + + ... 4- A«-l) W^ 

= (£:-14- AE-‘^ 4- 4-... 4- A- 

= 4- Aw.^„2 4- A2w^_3 4- ... 4- 

• * ~ 4" AUj,_2 4- A“Wj.._3 4" ... 4" A'''‘' 4" A^Zly.^, 

Since this is true for all valuer of n we have the convenient formulae 
u^ — Uj^_i + Aurj (which is otherwise evident), 

= ^x~i 4- Aw^_2 4- A‘'^w^_2, 

4- A«a,_2 4- A2 w^_3 4- A^^w^_3, 

and so on. 

Example il. 

Obtain a formula based on w„ similar to that given by the relation 

= 4-A)* Wo. 
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n—x 



n—x 


since is — A 


I, 


[i - (w - ^)(^) Ais'-i + (/? - a:)(o) - ...] ; 

• • ~ “ ^)(1) + (w — '^’)(2) “ • • • • 

It will be found that this is an ordinary formula which could be 
obtained by using the values in the reverse order ^o* 

There is as much justification for using one order as the other. It 
should be noticed that the same numerical values appear in the table of 
differences, but that they are in the reverse order with a change of sign 
for the odd differences. This should be tested by a numerical example. 

If ^ > w, we may use the general relation 

(«-»•)((, = (-i)‘(x-w+Z-i),(, 
and write the formula as 


«» = «» + (•»- n)a) A«„. 1 + (:k - « + I )( 2 ) +..., 

where the coefficients are positive and are those in the expansion of 


Example 12. 

Find the value of 

+ A*x”'+... to m terms. 

“ 2.4 2.4.6 

Since and higher differences of are zero, the sum of the 

series to tn terms is the same as the sum to infinity. 

Omitting the function x^y and working on symbols alone, we have 

A-iA-+-- A‘^4-...~A (i-.^A + i™ A^h-.. 

2.4 2 . 4.6 \ “ 2.4 2 . 4.6 J 

=A ^i-|A+htA2-fj^'A3 + ...j 

=A{i+A)-i=AE-i. 

The value of the given series is therefore 

AE'ix^'^ = A{x-lY = {x + lY-{x-1Y, 
if the interval of differencing be taken as unity. 
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Further examples of the application of the method of separation 
of symbols to the operators A and E and to other operators will be 
found in Chapters VI and VII. 


< 21. Factorial notation. 

For many purposes it is useful to use a notation for the product 
of ni factors of which the first is x and the successive factors decrease 
by a constant difference. 

Generally, 

^(m) = y „ 2I1) (x — 3/?) ,..{x — ?n— 1//). 

For convenience in working w^c shall take h = i. Then, if 
^(w) = .r (.X~ i) (x — 2) (.r — 3) ... {x — m — i), 

= 1) i)... (x-~m — 2)—x{x— i){x~ 2 )... (x — m—i) 

~mx (x~ i) ... {x~m — 2) 


Similarly — 

and, eventually, — ml. 

Again, from the definition of x^^\ 

= (a* — w + I) 

wTen m = o, = (.v 4- i) 


By convention, = 


or 

When fw = — I 


so that 


I — (x+1) 
x -\-1 * 

= (x + 2) X^~^\ 

I 

^-4-2 (x4-l)(^+2)’ 


Generally 


_ 4 __ 

(a^ 4 - i) Gv + 2) ... {x-h?n) 


I 

(.V + * 


This notation, adopted by Aitken and Milne-Thomson, differs from 
that used in Boole’s Finite Dijfcrences^ Steffensen’s Interpolation and 
Freeman’s Actuarial Mathematics. It has the advantage that, for any 
value of the relation 
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is valid for all integral values of r and w, whether negative, zero or 
positive, provided that division by zero is not involved. 

By proceeding as above it can be shown that 

(-WI)_ _ (-m-f 1). ^2^(~m ):= m {m-\r l) X^\ 

and so on. 

It should be noted that the result of differencing is to 

increase the degree of the denominator, and that, as a result, 
jg constant. 

A special case of is where a? is a positive integer. We have then 

{x—i) {x — 2) ... {x~m +i) = x\l{x — 7 n)\ or 

It is also of interest to note that the result of differencing is 
analogous to that of differentiating x'^. We have 

Ax^"^^ = and Dx^^ = nix^~^^ 

and D^x^=^ml 

Similarly we have 

These relations are very important. 

Note. In the demonstrations above Ax has been taken as unity. If 
Ax = h, then — mhA^x^^'^^~m {m—i) and so on. 

The general principles are the same. 

Example 13. 

Express zx^ — + 2X — lo and its differences in factorial notation. 

Let 

— + 10 = Ax {x— l) (x — 2)-\-Bx {x— i) + Cx-{- D. 

Putting a: = 0, I, 2 in succession, we obtain easily that 
Z)= —lo; C=2\ P = 3. 

By equating coefficients of x^ on both sides of the identity we find 
that A = 2 . 

2X^ — 3^'^ + 3.V — 10 = 2X^^^ + 3JC<2> + 2X^^^ — 10. 

Au^ = 6 x^^^ -f 6 x^^^ -f 2, 

A^u^= I 2 a:^^^ 4 * 6 , 

A®tta.= 12. 


and 
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22. An alternative method for expressing Pn(x) in the factorial 
notation is by use of detached coefficients. By this method any such 
function can be written down in the form 

with very little trouble. 

The principle can best be illustrated by an example. 

Example 14. 

Write down 1+ 5^:^ + 2X- + x-i^ in factorial notation. 

Let 

+ 2x--\-x- I ^ — -f + E 

— Ax (x—i) {x — 2) (.r — 3 ) + Bx {x—i){x — z) 

-h Cx (x - 1) -h Dx + E. 

If we divide by x^ the quotient will be 

I IX^ + 5.^2 + 2 X + I 

and the remainder —15 =E, 

Divide I -f 5^*^ + 4 -1 by jc — i : 

x-i 1 ix'^ + 5x‘'^ 4 - 2x4- I 1 4 -16x 4 -18 

IIX'^— iix- 

i 6 x‘^ 4 * 2 X 
i6x“— i6x 

18x4- I 
iSx— 18 

19 -D. 

Divide 11 x- 4 - 16x 4 - 18 by x — 2: 

X —2 I ix*^ 4 -16x4-18 11x4-38 

11 X 2 — 22 X 

38x4-18 
38X —76 

^ =C, 

and so on. 

The above processes may be appreciably shortened by adopting the 
following procedure: 

(i) omit the x^, x^, x^, ... and work on coefficients alone; 

(ii) change the sign of the constant term in x- i, x~2, x —3, ..., 
so that addition takes the place of subtraction. 
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The required remainders can then be easily obtained, d'hus: 


i II 

5 

2 

I 

j 

11 

16 

18 


16 

18 


! 0 

22 

76 j 

1 

II 

38 

94 


0 

33 



II 

71 



0 





I r 

/. iiAr^ + X - 15 = - 15. 

This short method is the method of detached coefficients and 
is of particular advantage in solving certain problems in summation 
of series (see Chap. VI). 

23 . It is often simpler, and in some cases more practically useful, 
to express in terms of Ar(,.), ... rather than in terms of 

^(r)^ ^(r~i) ^ ^ This can easily be done by evaluating w^, ..., 

forming a difference table and then using the formula given in 
paragraph 15. Thus, using the polynomial in Example 14, we have 


X 


Aw^ 

AX 


A«». 

0 

-15 

19 




I 

4 

207 

188 

426 


2 

211 

821 

614 

690 

264 

3 

1032 

2125 

1304 



4 

3157 






whence = 264jr(4) + 426*x:(3) -f i 88 jC( 2) + 19%) —15. The arithmetic 
can be made even simpler by using w_i, W2 

inserting the constant difference for. the purpose of obtaining the 
leading differences of Wq. In the case of the polynomial considered 
above, the constant difference will be and it is instructive for 
the student to rework the example on these lines. 
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By constructing difference tables, find; 

1. The sixth term of the series 8, 12, 19, 29, 42, .... 

2. 'The seventh and eighth terms of the series o, o, 2, 6, 12, 20, .... 

3. The first term of the series whose second and subsequent terms 
are 8, 3, o, -- i, o, .... 

4. The entry corresponding to the argument 3 from the table: 

X (argument) 5 6 7 8 9 10 

y (entry) lo-i i8-i 29*5 44*9 64*9 90-1 

5. The tenth term of the series 3, 14, 39, 84, 155, 258, .... 

6. Given that y ~ x'^ — 4 - a: 4- 10, verify by constructing a di (Terence 
table that the value of v when 10 is 920. Use the following values 
of I, 2, 3, 4 , 5, 6 and the corresponding values of y. 

, 7. Prove that 4- + A“/q 4- . 

8. Find A^?4^, where u^ — ax^-i^bx^-hexyd and the interval of 
differencing is h. 

9. is a polynomial in v, the following values of w inch are known: 
Wg = W3 = 27; 1/4 = 78; f/g = 169. Find the function u,j.. 

10. Obtain A^^ [(i — ax) (i —hx^) (1 — ca:^) (i -r/v'^)]. 

11. Find and Hence sum the first ten differences 

of 


12. What are the functions whose first differences are (i) x; (2) c®; 

(3) 9 ^^ + 3 ? 

^ 2 - Wa.-( 5 ^ 4 - i2)/(.v-4-5 *v + 6). Find Au^ and A^w^. 

14. U.J,— —{x— i)~i (jr — 2)"^. Find Au,. 

15. Wj = (i2 — Jr) (44-a;); z/g-(5 ~x) (4 —.y) ; M3 = (^ 4 -18) (y 4 - 6 ); 
1/4 = 94. Obtain a value of x, assuming second differences constant. 

16. Find A'^u^, where is (i) ax^-hbx^'-^, (2) 

17. Show that M4 = //o + 4Awo4-6A“//_i4-ioA^z/_ 4 as far as third 
differences. 


18. The first four terms of a series are o, 5, 16, 30. Find the sixth 
term, using the relation in Qu. 17. 


19 - 


Find the value of 


r 4- 
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20. Obtain the function whose first difference is 

+ 3^:2 4-5^+ 12. 

By means of the relation = 4-A)"® find 

21. z/i2 given u^^ = 2; 2/3 = 14; 2/2 = 40; = «4=i57; ^'5 = 258. 

22. 2/6 given 2 /o = 25; 2/^ = 25; 2/3 = 22; = 18; 2/4 = 15; = 15. 

^3- ^9 given 2/0=1; 2/3 == 11; 2/2 = 21; //^ = 28; 2/4 = 29. 

24. The tenth term of the series i, 37, 61, 77, .... 

25. The eleventh term of the series i, 4, 13, 36, 81, 156, 269,_ 

26. Prove that the rth difference of a polynomial of the wth degree 
is a polynomial of the (« — r)th degree if r<n. What happens when 
(i) r = «, (2) r>n} 

27. Define the functions and Obtain their nth differences, 

distinguishing between the cases when n%ni. 

28. Represent the function — izx^ 4-42^:- *- 30A: 4- 9 and its succes¬ 
sive differences in factorial notation. 

29. Find where 2/^. is 

(i) {ax-\-b) {a,x-\- i+h) {a,x -{-2 + h) ... {a,xrn-t-h) given ni >n. 

(ii) [{ax + b) {a.x+ i 4 -/>) (/z.x 4- 2 4- /») ... {a.x + m + b)]-^, 

30. Obtain A sin jc, A tan x and A (jc4-cos x) where the interval of 
differencing is a. 

A2\ A^m. 

31. Explain the difference between ( and and find the 

values of these functions when u^ = x^. 

32. 2/3.==sin^. Show that A-//^, =/?E2/^ where/e is constant. 

33. Prove that A (tan"^ A^) = tan~^ , where h is the 

interval of differencing. 

Use the method of separation of symbols to prove the following 
identities: 
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37. «o + *(i)A«i + *,2, + *,3,A^«3 +... 

= + .r,,, A-«^_i + Ar,2| A^«^_2 + —• 

^ U^X UnX'^ 

38. Wo + - I + , + ^ , +... 

ll 2! 3! 


x'^ 

)/„ + xA?j„ + — A^w,, -f ■ , A».v„ -h... 

2 I 


39 " ^X+l'^^X'^2 * • * “ ^ o 


•3 

2! 



1 * 3/5 

3! 



40. i/gn “«(!)• . 2 “// 2 ,,„.o -...+( ~ 2)^' u, = ( - I)'' (r - 2ari), 

where Uj^ = ax^ -\- hx + c. 


41. * Ax=-x//;?,;f4 - 

42. Find the relation between a, / 3 , y in order that ol + + yx- may 

be expressible in one term in factorial notation. 

43. Sum to n terms 

1.2A.Y'' — 2 . 3 4 * 3 . 4A'\y’^ - 4.5 A^y” 4-.... 

44. II be a polynomial in .y of the third degree and Ay= i, prove 

that ^.(2) 

4 -vA/^o 4 -, AX+ ^X* 

45. Prove that 

UQ + 7 l(^)UyX + //( 2 ) ^2 + • * * 

= (l 4 -.Y)'* Wo-+ ”(i) + 4 -y )”"2 .y-A^w^, 4 -.... 

46. If w be a positive integer, prove that is the difference between 
the two series: 

«(1) “j + (w + I)(:t) AX + {71 + 2),51 A%_] + ... 

(«- l)„)«0 + W(s)AXi + («+ i)(5)AX, + .... 


and 
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INTERPOLATION WITPI EQUAL 
INTERVALS 

Interpolation may be defined as the operation of obtaining the 
value of a function for any intermediate value of the argument, 
being given the values of the functions for certain values of the 
argument. The process has been picturesquely described by Thiele 
as *The art of reading between the lines of a table Where the form 
of the function y — is known or can be deduced from the given 
values, the ordinary algebraic process of substitution can be used 
and the required value obtained with little difficulty. In actuarial 
work the relation connecting the function and the independent 
variable is seldom simple or cvi4ent, and it is then that recourse 
must be had to finite difference methods. 

'^2. Before proceeding to examine the practical aspect of inter¬ 
polation the question of negative and fractional values of n in the 
expression (i -f-must be considered. The proof of the identity 
-f by means of operators or by induction, as in the 
previous chapter, ceases to have a meaning if n is negative or 
fractional, since the reasoning assumes that the argument x advances 
by steps of A at a time. The assumption that, so long as the ordinary 
algebraic rules are followed, the expansion is true for all values of 
the quantities involved and not only for certain specified values, 
is not necessarily tnie, and an analogy can be drawn between the 
application of the binomial theorem to algebraic quantities and to 
operators. For example, the expansion (i is only convergent, 
i.e. is arithmetically intelligible, for negative values of n, provided 
that is numerically less than unity. 

Thus (iI-f + to an 

absurd result if we put x — z^ for then we should have 

(-i)~2=i4-4+i2-f 32 + ... 
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which is impossible, since 

Similarly, in some cases there is no possibility of expanding 
(i "f A)” u.^ by the use of the binomial theorem. 

Consider the two following series of corresponding values of 
X and w p: 

(i) .r o 1 2 3 4 5 

Hr I 4 9 25 36 

Then to find the value of, say, we shall have 

= 1 A?/() 4 -" ^ 2!”^ + • ■ • * 

The leading differences are Az/() = 3 and higher differ¬ 

ences being zero. 

Wi=i + L3+"-";,- ■ *2 

which is otherwise evident, since is (i I .v)^, and, for the value 

= (1)^ = 4- 

(ii) ;c o I 2 3 4 5 

I 5 25 125 625 3125 

This is evidently a geometrical progression and the function 
from which the values are derived is = If we attempted 
to express as a polynomial in x so that 

--ahx 4- cx^ 4- dx^ f ..., 

and then applied the relation £^%=(i4A)^ Wq for the value 
n “ I, we should obtain as above 

u^^=Ehl^={l+^f Mo 

Here the leading differences are i, 4, 16, 64, 256, ... and tend to 
become successively larger. But Mj = 5^ = 2'24 approximately, and 
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this cannot be the same as the divergent series found by expanding 
(l+A)iMo- 

Hence unless the function is capable of being expressed as a 
polynomial in x we cannot use the relation = for the 

value « — 

Let us now consider the problem more generally. If be a 
polynomial of degree k in .v, we may write 

Uy. = a-hl>x-i- cx^ + ... + px^. 

If we adopt the binomial expansion of (i + for expressing 
in terms of Uq and the leading differences of w^, namely, 

+ ‘^’( 1 ) + ... 
we have two series for which are equivalent for more than k 
values of the variable, since they are true for all positive integral 
values of x. 

Hence by a well-known algebraic theorem, they are true for all 
values of Xy positive or negative, integral or fractional. 

Therefore so long as is a polynomial in x the binomial expan¬ 
sion is valid for all values of .v. It is not necessarily valid for other 
forms of function, and we are led to the conclusion that we can 
expand in terms of A?/,., ... A^w^, ... for all forms of 

the function if w be a positive integer, but for other values of n only 
if is a polynomial. (Cf. Chap. I, paragraph 17.) 

^ 4 . All finite dilTerence formulae which are employed for the pur¬ 
pose of interpolation are based on the hypothesis that the functions 
in question can be represented by polynomials, i.e. by rational 
integral functions, with sufficient accuracy for the purpose in hand. 
This assumption is the justification for extending the formulae 
to fractional intervals: the processes to be explained in this and 
subsequent chapters are simply various methods of carrying out 
the calculations based on these assumptions. One special advantage 
of these methods is that it is unnecessary to fix in advance the 
degree (n) of the polynomial; the interpolation formulae will be 
in such a form that to increase n will merely involve the intro- 
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duction of a fresh term without affecting the other terms. The 
introduction of additional terms however will not necessarily im¬ 
prove the approximation or justify the assumption, although it will 
generally do so. Fortunately, in actuarial work the functions with 
which we deal are usually such that the assumption is sufficiently 
accurate for our purpose. 

In applying the formula in paragraph 3 to a given set of data 
the following points should be noted: 

{a) If the basic curve is y a bx + ... + there will 

be n constants, and in order to determine these constants n equa¬ 
tions are necessary. For there to be n equations, values of y 
corresponding to n values of x must be given. Therefore either n 
points on the curve, or n other corresponding relations between x 
and jy, must be known. In that event the curve will be of degree 
I, and nth and higher differences are zero. 

{b) Our investigation has been confined to equidistant values 
of the argument. If the given values are not equidistant a formula 
slightly different in form from the expansion can be 

developed with a modified method of differencing (sec Chap. III). 

With regard to the statement (a) above that for a curve of degree 
n — i there must be n facts given, it is not essential that n points on the 
assumed curve must be known. We may have given, for example, three 

dii 

points and two values of the differential coefficient , Here w^ have 

five facts; we assume therefore a fourth degree curve, so that fifth and 
higher differences are zero. 

6. Newton’s formula. 

The formula = + + ... is known as 

Newton’s formula, and is the fundamental formula for inter¬ 
polation when the given values are at equidistant intervals. The 
expansion can be applied to solve many forms of the problem of 
interpolation. 

The following variations of the problem may arise: 

(i) Where there are n equidistant terms and it is required to find 
an intermediate term. 
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(ii) Where there are n equidistant terms of which n—i are known 
and it is required to find the missing term. 

(iii) Where there arc n equidistant terms of which n — r are 
known and it is required to find the r missing terms. 

Note. Some modern writers have adopted the name “Newton- 
Gregory formula” for the above expansion, as the first publication 
appears to have occurred in a letter from James Gregory to John Collins 
on 23 Nov. 1670. The letter is given and the question of Newton’s 
priority is fully discussed by D. C. Fraser inJ.J.A. vol. Lii, pp. 1 17-35. 

/ 7 . Examples of the variations referred to above are given below: 
they are all solved by assuming the last difference constant. 

Example 1. 

The values of annuities by a certain table are given for the following 
ages: 

Age X 25 26 27 28 2C) 

Annuity-value 16-195 15*919 15*630 15*326 15-006 

Determine the value of the annuity at age 27 J. 

Five values are given: we must therefore assume that fourth differ¬ 
ences are constant. The difference table is 


X 





AVi^ 

25 

16195 

— -276 




26 

15-919 

— -289 

-•013 

~ -002 


27 

15-630 


--•015 


+ -OOI 

28 

15-326 

-•304 

— -016 

— 'OOI 


29 

15-006 

-•320 





The leading differences correspond to the argument x = 2^ and we 
require the entry for age 27^. Our formula is therefore 

<225 = «25 


, 25 2-SXI-5X-C,, 

I + 2 -5A+ . - A 2 + ^ i— ^ A® 


I 2-5 xi-5x -5x(--5) 

24 

= <225 + 2-S'i‘<225 + I + ‘3 “ ■039o6A«<226 

= 16-195 - -6900 — -0244 — -0006 — -00004 
= 15-480. 


a 


25 
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Note. Since the data are given to three places of decimals, sufficient 
figures have been used to give three places only in the result. Since our 
interpolation is based on an assumption, namely, that fourth ditlerences 
are constant, a result to more than this number of decimal places would 
be unjustifiable. 


Example 2. 


From the following data find the value of w.jy: 

M4e= 19-5356; k.,9= 19-6513;'"^.^^= 19-7620. 


We cannot form a difference table, since the given terms are not 
equidistant. As however four terms are available we may assume that 
third differences are constant, and that as a consequence fourth differ¬ 
ences are zero. 

If the function is we assume therefore that = o whatever 

the value of x. We may write 

A%4e = o, 


i.e. {E-if 

or (E^ — 4 4 - - 4^* + i) ^^40 = o, 

i.e. — 4^^49 d* 61/4,— 4^47 d" ~ o, 

so that 197620-78*6052 4 -117-2136 ”4m,i7 4 - 19-2884 = 0, 


from which 


/^47 = 19-4147. 


Note. As mentioned above, all the given terms are not equidistant. 
I'he method however depends upon the fact that the term required 
makes up in all five equidistant terms. 


Example 3 . 

Complete the following table: 

X 2*0 2-1 2*2 2*3 2*4 2-5 2-6 

*135 -III *100 -082 -074 

This is similar to Ex. 2. Instead of using the assumption once that 
Nu^ — Oy we write down two equations of the same form, thus 

A"’z/2.o = o> so that (£’ - i)^ «2-o = o, 
and A^M2 *i~o, „ {E-if 

Our two equations then become 

^2-5 “* 5^'a'4 + lOtta.g - IOW2-2+ 5«2-1 - ^'2-0 
and W2-6 - 5 " 2-5 + IOW 2-4 - + 5 ^^ 2-2 ~ “2*1 = o, 

since the interval of differencing is o-i. 
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Inserting the known values of and solving, the required values are 
easily found to be u^.i = *123 and 1/2.4 = *090. 

Note. The function in this question is jy = and the tabular value 
of W2.4 is *091 correct to three decimal places. This difference is due to 
the fact that our assumption that the curve y — Uj. is a polynomial of 
the fourth degree in x is only approximately true. 


^ 8. Change of origin and scale. 

If we had plotted the curve = on which the values of in, 
say, Example 2 tvcre assumed to lie, we should have had values ofy 
corresponding to values of x at 46, 48, 49, 50. Precisely the same 
curve would result, however, if we changed the origin of our co¬ 
ordinates so that 46 was represented by the value x~o, 48 by a; = 2, 
49 by x ~2 ^ being unaltered. This process 

of changing the origin simplifies our notation considerably. In the 
examples above w’e could have changed the origin alone, or both 
the origin and scale, and could have altered the questions to read: 


Ex. I. Origin at age 25 

Unit of differencing i year of age 
Ex. 2. Origin at ^^ = 46 

Unit of differencing x=i 
Ex. 3. Origin at x — 2’0 

Unit of differencing x—i 


Given //q, w^, Wg) ^4 
Required 

Given Wq, W2» ^hy 
Required 

Given Wg, Wy, Wg, Wg 
Required /q and 


c 9 . If in Newton’s formula 

«x^ nh. = «x + + «(3) + ■ • • 

we put h=i, x — Oy and replace n by jc, we obtain the series 

7/P = T/q + *^(1) + ^’(2) + • •' • 

This is generally called the advancing difference formula, and gives 
in terms of Uq and its leading differences, where the interval of 
tabulation is treated as the unit abscissa. 

If, however, we wish to obtain in terms of and its leading 
differences, we may write the formula 

M* = U-m+hn+x) = = (> + 

= + (ot + X) (1) + (»I + X) ( 2 ) + ... 

+ (m + x)(,) + 
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It is often more convenient to use this formula than to obtain 
in terms of Uq and differences of the advantage being that thereby 
we can make use of values of the argument on either side of x. 

In some cases, particularly when dealing with the summation of 
certain series, it is expedient to represent the first term by W], rather 
than by Wq. If in Newton's formula we put // — i, x—i we have 

= i)(i)A2/i + (a:- i)(2)A2«j+ ... 

or = + + 

and these formulae can be used for the summation of the series 
W]-f W2-f W3+.... (See Chap. VI, paragraph 2.) 

yT 10. Subdivision of intervals. 

A frequent problem in actuarial work is the interpolation for 
values of at intervening points given every fifth or tenth value of 
the function. For example, the problem may be to complete the 
series u^y Uc^y ... from the known values w,,, z/5, u^^y ... or 
from //q, ^20’ ^20’ 

A simple method for obtaining the intervening values where 
quinquennial values are known is given below. 

l^et denote the difference for unit intervals of x and Am^ 
denote the difference for quinquennial inten^als. 

Then u^^^ may be expressed as either (1+8)^ u^ or as (1+A) u^. 

Symbolically (i -f 8 )^ = i + A, 

i.e. (1 +8) ~(i -f A)^’, 

or 8s(i+A)^ —I. 

From this relation we can find easily that 

hi^ = (’2A - *o8A^ + *048A-^ - ...) 7/^. 

Hence 8‘-^w^ = (•2A — *08A^-f *048A ^~ ...'f u^ 

= (-04A2--o32A^+...)k^. 

Similarly 8®w^ == (-ooSA^ —...) 7^^. 

The same principle can be adopted if decennial values are 
known. In that event Am^, A%^, ... will represent differences for 


FM A S 


3 
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decennial intervals, and the individual differences will be found 
from the identity 8 = (i -f - i. 

An example will show the application of the method. 

Example 4. 

From the following table of yearly premiums for policies maturing at 
quinquennial ages, estimate the premium for policies maturing at all 
ages from 45 to 50 inclusive: 

Age a: 45 50 55 60 65 

Premium 2-871 2-404 2-083 1-862 1-712 

The leading differences for quinquennial intervals are 

Auj. Ahij. 

— •467 +-146 —-046 -f-017 

'Phe formulae required are 

8z/^ = (-2A ~ •o 8 A‘‘^ 4 -*048A-^ —'0336A^) —-1078592, 

82 w^ = (‘04A*‘^--032A^ + -0256A‘^) 1/^,= +-0077472, 

-- (-ooSA^ - -ooqbA^) = — -0005312, 

= +*0000272, 


assuming fourth differences constant. 

We have therefore by completing the table of differences, 


Age 


SMsc 



45 

2-871 

--10786 



46 

2-763 

— -1001 I 

+ -007747 

— -0005312 

47 

2-663 


+ -007216 

+ -0000272 



- -09290 


— -0005040 

48 

2-570 


+ -006712 

+ •0000272 


— •08618 


- -0004768 

49 

2-484 

- -07995 

+ -006235 


50 

2-404 





Note, Since we require the value of the premium to the nearest 
penny, three decimal places will be required in the column. In this 
example, for results correct to three figures, four decimal places will be 
needed: must therefore be given to five decimal places, B^u^ to six 

and B^Ujg to seven, since errors in higher differences accumulate rapidly. 
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G iven the following data (a), find the missing term or terms (6): 

I- (a) Mo = S8o, Mi = S 56 , «2 = 52 o, a 4 = 385 ; (b) a-j. 

2. (a) ai = 386, a3 = 53o, a5 = 8io; (b) a^; a^. 

3. (a) ao=iso, aj = i92, a2 = 24i, a4 = 374; (h) 11.^. 

4. (a) aj = 94, a3 = 265, a5 = 4i5; (b) u^', u^. 

5. ( a ) a„ = 6o2i, ai = 5229, aj, = 4559, U 3 = 2979 ', (*) "J- 

6. (a) a5o = 92345, a5i = 9i556, a52 = 90748, a55 = 88204; (i) a^s; a^. 

7. (a) a_j = 202, ao=i75, aj^Sz, a2=:55; (6) aj. 

8 . (a) ao = o,ai = 3,a2=io,a3 = 34,a5 = 209,a8=ioo2; (/?) a,,;ag;a7. 

9. (a) ao= 192-1, a4=: 187-5, "2=184-7, a3= 184-6, ag^ 194-6, 

«6=i 99‘4. "7 = 212-7, "9 = 224-3; (6) ag; ag. 

10. (a) "0 = 98203, "1 = 97843, "2 = 97459, "3 = 97034; (b) u^.^. 

11. The numbers of members of a certain Society are as given in the 
following table: 


Year 

Number 


1910 

845 


1911 

867 


1912 


Miike the best estimate 

1913 

846 

you can of the 

1914 

821 

numbers in 1912 and 

1915 

1916 

772 

1916 

1917 

757 


1918 

761 


1919 

796 



12. Find />53 if pr^ = -98428, /> 5 , = -98335, ^ 54 = -98008, />s 5 = -97877. 

13. If "0, "i, "2, ... "g be consecutive terms of a series, prove that, 
if fifth dilferences are constant, 

"3 = -O5"o - -3"! + •75«2 + + -oS^'d* 

Supply the missing term: 

"0 = 72795 "4 = 67919 

"1 = 71651 "5 = 66566 

"2 = 70458 "8 = 65152. 
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14. «235 = 2-37107 “237 = 2-37474 

"236 = 2-37291 "238 = 2-37658- 

Find Mm 5 . 63 . 

15. Given u^ = —-5, Wj= --484, 2/5 = 0, 1/5 =-256, find the missing 
terms. 


16. Given the following data: = 15, %, = 50; estimate z/js- 

If you were given in addition z/., = 35, how would your estimate be 
revised? Illustrate your answer by a diagram. 


./ 17- 
table : 


Find the value of an annuity at 5I per cent, given the following 


Rate per cent. 

4 

4i 

5 

52 


Annuity-value 

17-29203 

16-28889 

15*37245 

14*53375 

13-76483 


18. Obtain approximations to the missing values: 

50 51 52 53 54 55 

f{x) 3-684 3-756 3-780 3-803 3-826 

19. The area ^ of a circle diameter is given for the following values: 


d 80 85 90 95 100 

A 5026 5674 6362 7088 7854 

F'ind approximate values for the areas of circles of diameters 82 and 
91 respectively. 


/ 20. Calculate the value of sin 33'' 13' 3c" from the following table of 
sines ; 


angle 30 31 

32 

33 

34 

sin x" -5000 *5150 

-5299 

-5446 

•5592 

21. ^75 = 2459; «go = 2oi8; z/s5 = 

= 1180; 

= 402. 

Calculate the 

values of ^79- 





22. P'rom the data in Qu. 21 complete the table for values of 
corresponding to individual values of x from 75 to 85. 

23. Four values of a function at decennial points are given. Express 

huy., (the differences for unit intervals) in terms of the differ¬ 

ences of the function for decennial intervals. 

Find the values % to inclusive, given Mo = o, «io = 'i74, %) = ‘347> 
«3o = -5i8. 
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24. Mo = 23-i 234; «« = 237234; ai2 = 24-6834; Wi8 = 26-i330. Com- 
plete the series to Mg. 

25. If you were asked at very short notice to obtain approximate 
values for the complete series /(o),/(i), /(z), .../(20), being given 
that /(o) = 'Oi3, /(io) = -248, /(i5) = - 578 , and /(2o) = -983, what 
methods would you adopt, and what value would you obtain for/(9)? 

26. +2/8=1-9243 2/2 + 2/6=1-9823 

+ w? = I *9590 W3 + W5 = I -9956. 

Find 2/4. 

27. Tables are available giving premiums at age 40 at the following 
rates per cent.: 

Rate per cent. 3 3I 4 4I 5 6 

^40 •025891 -024654 -023517 -022470 -021509 -019811 

It is desired to obtain at 5J per cent. Obtain this, using 

(a) two of the above values; (/ 3 ) four of the above values; (y) six of 
the above values. 

28. Given 

10 10 10 

- /(■^’) = 500426, ^ /(.r) = 329240, i:/(x) = i752i2 and/(10) = 40365, 

1 i 7 

find/(i). 

29. = 2/2 + 2/3 = 5*41; 2/4 + 2/5 + 2/6=18*47; 

2/7 + 2/g + 2/9 2/jo + 2/]i + 2 /i2 = 90*36. 

Find the value of 2/3. for all values of x from i to 12 inclusive. 

.r-10 

30. If you were given 2/9,2/3, and how would you complete 

x~ I 

the table of up to 2/30? 

31. Given 2/0=117-7; 2/2=110-5; 2/4=102*7; 2/30 = 75-4, obtain the 
values of Uj, for all integral values of x from o to 10. 

32. Obtain the following relation between nine terms of the series 
represented by 2/3, u^y ... 2/9: 

«6 = I («4 + ««) -1 (“S + «7) + U + «8) - tV («1 + «9). 

and find 2/5, given 

“1 = 74556; «2=-55938; a3 =-42796; M4=-32788; a8 =-18432; 
«^=:-i3i65; 2/8 = -08828; 2/9 = 0. 
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33. It is asserted that a quantity, which varies from day to day, is a 
rational and integral function of the day of the month, of less than the 
fifth degree, and that its values on the first seven days of the month are 

30, 30, 28, 25, 22, 20, 20. 

Examine whether these assertions are consistent. If so, assume them to 
be true, and find (i) the degree of the function, (2) its value on the 
sixteenth of the month. 

34. Extrapolation may be defined as the process of obtaining further 
terms of a series as opposed to interpolation, which is the process of 
finding intermediate terms. 

The values of a certain function, corresponding to the values 4, 6, 8, 
10 of the argument are 914, 742, 605, 500 respectively. Extrapolate to 
calculate the value of the function corresponding to the value it of the 
argument. 

35. Given Mo=i876, Wi = 777, 1/3=19, 1/3=— 218, interpolate the 
values of Wgj «4 and and find the form of the function, assuming it 
to be a rational integral function. 

/ 36. Show that Newton's formula 

= Wo + Awq + ^(2) + ^(3) + ^(4) + • • • 

can be put into the form 

U.J. -■= Uq + A//o — xaAhiQ + xabA^u^) — xahcA^u^ -j- ..., 
where w=:i-J(^+i), ^ = i - J (A?-f i), i —| (;c-f i) etc. 

Hence show that the successive coefficients converge slowly and 
tend eventually to numerical equality. 
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INTERPOLATION WITH UNEQUAL 
INTERVALS 

1 . In the previous chapter formulae have been developed on the 
assumption that the argument proceeded by equal intervals. 
Although in actuarial problems the data are generally given at 
equidistant intervals of the independent variable, it sometimes 
happens that we are required to interpolate when values of the 
function are known for unequal intervals. In other words, instead 
of values of for arguments jc + //, x-\-zh^ x+i^hy ... being given, 
the known values correspond to the arguments h, c, ..., where 
a —by b — Cy ... are not necessarily equal. 

2. Divided differences. 

Since we cannot use the differences as hitherto defined, we 
adopt a special process involving the argument as well as the entry. 
'The differences obtained by this process are called “divided” 
differences, and are found in the following manner. 

Let f {x)^Uj. be given for the values x — Uy x = b, x — c, x==d ..., 
where the intervals need not be equal. 

Then we have 

(i) First divided differences: 

b^a ’ c-b ’ d-^c ^ 
which may be written as 

fib, a); fic,b); f{d,c); .... 

(ii) Second divided differences: 

f{c,b)-f{b,a)_ f{d,c)-fic,b)^ 
c-a ' d-b 

f{c, b, a); fid, c, b); .... 


or 
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(iii) Third divided differences: 


/(rf, c, b)-f{c, b, a) 
d-a 


or f(dyCyb,a)] .... 


3. Notation for divided differences. 

There seems to be at present no universally recognized notation 
for divided differences. There are objections to the many forms 
in practice. For example, representing (Uf^ — u^)/(b — a) 

(Freeman’s Actuarial Mathematics, p. 57, and Henry’s Calculus and 
Probability, Chap. VIII), has the disadvantage that the dash is apt to 
be confused with the index (cf. and A preferable form 

A {a, b), A^ {a, b, c) is clearer, but departs from the recognized 
symbolical notation in that the function on which the divided 
difference symbol operates is not present; this renders the notation 
unsuitable for elementary work. The method adopted in paragraph 2 
above is simple, and where there is no ambiguity with f{x,y)y 
representing a function of two variables, there is no objection to its 
use. This notation does not, however, conform with the A notation 
for differencing when the intervals are equal and to that extent it 
is hardly satisfactory. 

The following are some alternative notations that have been used: 
Arguments Function Divided differences 

(1) a, b, c ... Ua, Wft, Wr A {a, h), A {b, c). A- (a, b, c )... 

(2) a,b, c ... u . (a, b), (b, c), (a, b,c) ... 

(3) ‘Vo, V3,.. /(vo),/(^i),/(v2)... 

(4) ^ly -^2 ... f {Xq), f {Xii)yf [X2) ••• [Voj'V^], [Vj, ^^3], [Xq, X^ ... 

( 5 ) ^Qy ^ 0 > ^2 ••• ^^^Qy ^Ai, 0 “Aq ... 

(i) D. C. Fraser, (2) W. F. Sheppard, (3) J. F. Stefl'ensen, 

(4) Milne-Thomson, (5) De Morgan. 

The notation that we shall adopt is due to Dr A. C. Aitken 
{Proc. Roy, Soc, Edin. vol. LViii, pp. 169 and 175) and has all the 
advantages possessed by the ordinary difference symbol. 
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The convention and definition are 

b 

a-b b-a' 

A^Ma = A (A«„) 

be C b 

a~c\a~b h-~a\ c — a\c~-b h — c] 

^ .. W,,_ Uf, _ __ 

{a — b) {a — c) {b - c) (b — a) {c — a) (c — b) ’ 

Similarly it may be shown that 

““ “ \a^-l}Y{a'^)~{a-d) + {b-~c)(b^ih -aj 

, ___ J!r _,_ »<l 

{c-d) (c- a) {c-b) {d-a) (d- b) (d~ c)' 

The symmetry of these differences is apparent and suggests 
that the property holds for a divided difference of any order. 
This is in fact true and the general proposition may be proved in 
many different ways. 

It may be proved by induction, by actually arranging the co¬ 
efficients or, incidentally, in establishing another important pro¬ 
position, namely that if is of the form (x) then A^Uj^ is 
constant (see paragraph 6). 

An elegant and succinct proof, based on the permutability of 
the arguments, is due to Dr Aitken, and is given below. 

* 4 . By definition 

AUr = {ii^-uMx-a) 

a 

^(u„-u^)/{a-x) = Au„. 

X 

The suffix of the operator and the argument of the operand are 
therefore interchangeable. Also, if is of the fonn (jc), by the 
remainder theorem ;c —^ is a factor of u^ — u^ and it follows that 
is of the form (x). 
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For divided differences of higher orders the definitions are 

A (AMa:)=A^x 

b a ab 

A (AX) = AX. 

c ba abc 

and so on. 

A divided difference of any order, e.g. is unaltered by any 

abc 

permutation of the arguments. For, since the suffix of any A 
operator and the argument of the operand are interchangeable, we 
have such interchanges as 

A (AwJ = A A (Am^i) 

b a b X ax 

= A (AMx)= A (Ai/„) 

a b X b 

= A (AjO- 

X a 

All permutations can be obtained by successive single inter¬ 
changes. Thus, in the illustration above, 

hax -> bxa ->■ axb abx xba -> xab. 


These interchanges are obviously possible for divided differences 
of any order and it follows therefore that a divided difference is a 
symmetrical function of all the arguments involved. 


Note. The proof given above depends entirely on the symmetrical 
property of the differences. It does not follow that if Uj, — Vy then 
Am«= A^^y In passing from, say, A (A^J to A (A^s) the student 

a a a b a X 

should satisfy himself that the equivalence holds by writing down 


A (A«x)=-7 

a b X 


a [ x — b 


a-b j 


_ Ux _ _^____ _ 

{x — a)(x--b) {b—x)(b — a) (a — b){a — x)^ 


A (A«6) = 

a X 


I 

b — a 


( b—x 


U a~U^ \ 

a—x ) 


__ ,__I Ug 

(b — x){b — a) {x—a){x — b) {a — b){a—xy 
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5. The method of forming a divided difference table is best 
illustrated by an actual example. 


Example 1, 

Take out the divided differences of given the following table: 


X 

Ur 


124 J 12 

22 30 82 106 206 


The table is 


X Uj. 

I 22 


2 30 


4 82 


7 106 


2—1 




26-8 _ 
4-1 " 


AV 




4-2 

m6 — _ 

7-4 


= 26 


-3-6-6^ 


7-1 


= -i-6 


8-^ 

7~-2 


= -36 




192. 


1-5- (- 3-h) 
12-2 


= •51 


20 — 8 


206 — 106 

-= 20 

12 — 7 


12 206 


Note, When the arguments and their order are fixed, we may use 
the shortened notation A^xj A^*^x> A^w^. ... for the leading divided 
differences of Mg., as in the example above. 


It is essential to arrange the work systematically if error is to be 
avoided. The numerators and denominators must be set out, either 
in parallel columns or in the form of fractions. Where there is 
ample space the columnar arrangement is better, especially where 
the divisors are cumbrous. It should be noted that while the 
numerators are the first ordinary differences of the preceding 
divided differences, the denominators are all formed directly from 
the arguments, differencing first in the ordinary way, then in pairs, 
then in triplets, and so on. It will also be seen that the divisor 
is always the difference between the values of x for the last and 
first involved in the difference. 
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6 . Newton’s divided difference formula. 

Let the given values of be 

Mfo, Wc ... «*, M,. 

Then, by definition, u^ = u^ + {x-a) 

X 

A Ua = A M« + («-*) AX. 

X b bx 

AX = A^«„ + (jf - c) AX. 

bx be hex 


A” «„= A" Ma + (Ar-/) A"+h/„. 

bc...kx bc...kl be.,.lx 

Hence, by successive substitution of each identity in the one 
preceding it, we have 
Kr = Wa + ('^-^) + {x-h) 

b he 

4- (.r — a) (x -b) (x — k) -f-(x-~ a)(x — h)... (x ~ 1) A” 

be ...l be...lx 

Thus «:r=--X + '^..+l(^). 

where represents the sum of all the terms except the last and 
Rn+i (^) the last. 

^?i+i {^) be written as 

{x — a) {x — h) ... (x — /) 

abc ...I 

by permuting the arguments of the divided difference. 

It should be noted that the term preceding R is 
(x - a) {x ~b)... {x — k) 

be...I 

involving n factors in the coefficient. In R there is an additional 
factor x — l. 

If we put X equal to a^hyC... in succession, the R term vanishes and 
we have the following results which are identities, analogous to those 
obtained for equal intervals in Chapter I: 

[ O’*-' h ^ ti) Wfl , 

b 

= = ZhWa + (^r--rt) {c - h) 

b be 

[U^]^^^d = u^=--u^ + (d-a) AUa + {d-a) (d-b) 

-\-{d-a) [d- 'b) {d-c) AX- 
bed 

It is seen that R is the only term involving ; it vanishes if x takes 
any of the values a^byC ... or for any value of x if A^'^X == c>- 
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It follows therefore that is that polynomial of the nth degree 
in X which takes the (n-f i) given values ... Ui, when 

x — a^h^c.,.l respectively. If therefore is itself a polynomial 
of the nth degree in a: it must be the polynomial \ i.e. R vanishes 
for all values of We have therefore that 


abc ...I 


and 


is constant. 

bc..J 


U^ is thus the unique polynomial of degree n which represents 
.... Therefore the coefficient of does not depend on the 
order in which ayb^c^,.. are taken. Since this coefficient is A^^w^, 
it follows that the divided difference is a symmetrical function of 
all the arguments involved. (See paragraph 3 above.) 

If Uj. is not a polynomial of the nth degree in we have only the 
relation rr . d / \ 


where the R term itself involves and if the expression 
(a;) be worked out by expanding the coefficients all that 
we obtain is the identity u^==Uj,, Thus our results do not help us 
to find Uj. unless we have some knowledge of or may make some 
assumption with regard to the value of Rj^^i{x). It may be said 
in fact that Rf^+i (x) plays the same part as does in Chap. I, 
paragraph 17. As in the case of ordinary differences it is assumed 
that Rn+i (x) is negligible and may be put equal to zero, giving 
finally ^. 

= «a + (^-«) AUa + {x-a) (x-b) 

h be 


+ (x--a) (x — b) ... (x-k) A”«a> 

be... I 

which is Newton’s formula for interpolation with divided dif¬ 
ferences. 

For the method of obtaining the limits between which (:v) 
lies, see paragraphs 16 and 17 below. 


7. Sheppard’s rules. 

If, as in the paragraph above, the arguments are x, a, b, c ky I, 
where is to be found and the n + i values w^... Uj^y Ui are 

given, we may put Newton’s formula into a more compact form 
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by the use of a notation due to Mr D. C. Fraser. This notation is as 
follows: 

x — a^A, 
x — b^By 
x-c^Cy 


The arguments a, by c involved in the divided differences and 
the factors A, By C are in the same order, with the exception 
that the small letters are always one in advance of the capitals, 
thus: 

Arguments, a ab abc abed abode .... 

Capitals, I A AB ABC ABCD .... 

Newton’s formula thus reads 

+ ABC/t^U^ + ... + ABCD ... K 

b be bed be... I 

Dr W. F. Sheppard has given very convenient rules, embodying 
the same principle, and by means of these rules formulae may be 
written down at sight for any intervals. As expanded by Mr G. J. 
Lidstone [Jf.I.A. vol. lviii, p. 65 and references there given] these 
rules are as follows: 

“(i) We start with any tabulated value of u. 

(ii) We pass to the successive differences by steps, each of 
which may be either upward or downward”; 

[each step involving a new u whose subscript will be numerically 
the next lower/higher if the step is up/down, and the w’s are 
arranged in the numerical sequence of the variables.] 

‘*(iii) The new suffix [of w] which is introduced at each step 
determines the new factor (involving x) for use in the next term.” 

[That is, each divided difference of the «th order has for its 
coefficient the product of n factors of the form {x — aj^ where aj^ 
represents a value of the variable, and has to be given all the n 
values that were involved in the last preceding difference. 

These rules apply whether the intervals are equal or unequal; if 
they are equal the divided differences are of the form A^^w/«!.] 
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These rules are familiarly known as the ‘'zig-zag rules” for 
reasons which will now be explained. If we are given a set of 
and their differences as in the following scheme 




Ud 


AUa 

b 

be 


Mih 



C 

/ 

/ 

bed 

AUc 



d 

de 

ede 


Mtfa 

^bcde 


and use them to interpolate for 7/^, it is not necessary to use the 
formula involving and its leading differences. We may begin 
with any of the u's and at each step move either upwards or down¬ 
wards in proceeding to the next column in the difference table, so 
that we may if we like follow a “zig-zag” route; in fact with n 
values we have a choice of 2^*“^ diilerent routes. [Cf. Sheppard, 
J.I.A, vol. L, p. 89,] 

Suppose we wish to begin with The next term may be eitl jer 
AW5 or since both of these involve u^: let us take in- 

c d c 

volving Uf. and We may next take either or since 

be cd 

both of these also involve and let us take A^w^^. We may then 

cd 

move to A^w^ or and we select the latter. We then have no 

bed ede 

further choice and we move to Our formula thus brings in 

bede 

the small letters in the order 

Cy by dy Cy U 

and therefore the factors of the coefficients will enter in the order 

I, Cy By Dy E. 
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Thus we may write down at once the required formula 

+ + + BCDA^Uf^ + BCDE/^u^ 

c cd cde bcde 

or 

«x = «f + {x - f) AM(, + (x -b)(x- c) A^Kf, + ( v -b)(x-c) {x - d) A®«i, 

c cd cde 

+ {x-b) (x-c) (x-d)(x-e) A}fu„. 

bcde 

Sheppard’s rules are of the utmost importance, and a clear 
understanding of them will save the student much trouble. 

8. The following is an alternative proof of the property of divided 
differences which was established in paragraph 6. 

If be a polynomial of the wth degree in then the wth divided 
difference of is constant. 

It will be sufficient to consider the function y = Then, if the 
values of the argument a; be a, c the first divided difference 
is 

b 

— a^')j{h — < 2 ) = 6^"^ -f 6''“ 2^7 + + ... + 77 ''“ h 

This is a symmetrical function of the {n — i)th degree in a and 6, 
and is the coefficient of in the expansion of 

(i -f + ...) (i +77V-f-aV-h ...), 

i.e. in the expansion of — ^ - -. —-- - . 

^ i~bx 1 —ax 

Thus A^” is the coefficient of in - .— ~y~ and therefore 

c I — cx I — bx 

A^i^ is the coefficient of x^~^^ in 

be 

c~a\i—cx 1 ~bx i—bx i — axj ’ 

i.e. in X I - 1, or the coefficient of x^~^ in 

\i“CJC i — bx i—axj 

I I 1 

I — ax' 1 — bx’ 1 — ex' 

Proceeding in this way, we find that is the coefficient of 

be,.. 
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—i— — L — I — where there are r factors in the 

1 — ax i—ox i—cx 

product. This coefficient is evidently symmetrical in 6, c — 
Finally, /^a'^ is the coefficient of x^ in a similar expression, and 

be.,. 

this coefficient is unity. 

Thus, the wth divided difference of a polynomial of the nth degree 
in X is constant and equal to the coefficient of the highest power 
of x; and higher divided diffeiences vanish. Also, it has been 
proved incidentally that a divided difference is a symmetrical 
function of the variables involved; for the expression obtained for 
dfa'' is symmetrical in a, b, c .... 

bed... 


9. Newton’s divided difference formula is quite easy to apply in 
practice, as the tollowing example will show. 

Example 2. 

From the data in Ex. i, find 

Assuming fourth divided differences constant, the formula gives 
Wp=:zq4-(8-i) T/q-f(8-i) ( 8 - 2 ) 

+ (8~i) (8-2) (8-4) d\X+(8-i)(8- 2) (8-4) (8-7) 

- 42 d + 168 -f-1 hS 

= 22 + 564-252 — 268-8 4-32*3 
= 93 to the nearest integer. 


10. Relation between divided differences and ordinary differences. 

If the arguments a, b, c ... k, I are spaced at equal intervals A, 

A — /;~a = r — = J — c=...=/ — A. 


we shall have 


Hence in fonning /I\w the divisors will all be equal to h. 

Thus d\ tfa — 

b 

Similarly in forming AiU the divisors c — a^ d — b ... will all be 
equal to zh. 


! Jr. 

be 


In forming /^u the divisors d-a, e — b ... will all be equal to 3A. 


F M A S 


4 
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A:^Ua = A.^uj2lh\ 

bed 

In general A”«a= A.'^ujn ! A”. 

bed ... 

If therefore the common interval h >' unity, the divided differences 
will diminish much more rapidly than the ordinary differences. 
In interpolation formulae this is counterbalanced by the denomi¬ 
nators of the form n\ which appear in the coefficients of the 
o-rdinary differences but not in the coefficients of the divided 
differences. 

The formula for in terms of and its leading divided 
differences is 


= + + {x-b) + 

b be 

+ (x — a){x — b) ... (:v - /f) 4 

be,.A 

This becomes, on putting x — a — nhy 

^a^ = «)] + • * .. 

b be 


If now b — a = hy c — a — zh ... 

nh^u^ nh (nh — h) 

^a+nh == ^ tAIi^ 

nh {nh — h) {nh — zh) A®w^ 

+- JO? -+ ••■ 

from the relations proved above; 

i.e. ^a+ri h f" ^^(1) ^(2) ^ "h ^(3) A’ , 

which is Newton’s formula for advancing differences. 

It is easily seen therefore that in order to pass from the divided 
difference formula 

Ux = u» + (^-^) Atia + (x-a) (x-b) AX + ... 

b be 


to the advancing difference formula (when the intervals are the 
same) we may replace A by A, drop the subscripts and insert 
factorial denominators thus: 




{x — a){x — a—i) 


AX + --- 
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11. Lagrange’s interpolation formula. 

On the same assumption as has been made hitherto, namely that 
the function concerned is a polynomial in x, an interpolation 
formula can be evolved which is equivalent to the process of 
splitting up an algebraic fraction into its partial fractions. 

Let n values of the function y = be given, so that Uj, is sup¬ 
posed to be a polynomial of the (;z“i)th degree in x, and let the 
given values of x be <2, c, ...,7, k. 

Then we may write 

A {x — h) [x — c) ... {x — k) + B{x — a){x — c) ... (x — ^)-J-... 

-\-K{x — a){x — b),., {x—j), 

where there are n terms each of degree n—i in x. 

This is true for all values of x involved. Put therefore x — a. 

Then 2/,, — A {a — b){a — c) ... {a — k), 


• A ^ ^_ 

** {a-b){a-c) ,..{a-ky 

Similarly, by putting x — hy 

n ____ 

... {b~ky 

In like manner all the coefficients can be found. 

(x —(x —r) ... (x-k) ix — a) (x — c) ... (x — k) 

"x - “a i-y y :7^_r7^y+• • * 

(x — a) (x — b) (x — c) ... 
{k^a) 

or otherwise 

__^_^_L_ 

(x — a) (x — b) ... (x — k) (a — b) (a — c) ... (a — k)'x — a 

Ufy I 

{b^(b ^c)T. 7 (b - k) ■ 7 - 1 } ■'■•••• 

It is evident that this is exactly the same as splitting the fraction 

__ 

(x — a) (x-b) ... (x — k) 

into partial fractions. 


4-2 
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This alternative expression is due to Euler and was given earlier 
than Lagrange^s formula. 

It is interesting to note that Euler’s form, when written as 

Wa: „ 

{x — a){x — b) {a —x) {a —h) {b — x) [h — a) ... "^ * * * ^ 

is an expression for the divided difference A” It follows therefore 

abc ... 

that Euler’s formula (and consequently Lagrange’s) can be evolved 
from the divided difference formula by equating the ??th divided 
difference to zero. Also, since the expansion is symmetrical in x, a, Z>, c 
... ky the divided difference is independent of the order in which the 
arguments are taken, as stated in paragraph 3 above. 

12. Lagrange’s formula is usually laborious to apply in practice 
and requires close attention to sign; it is generally simpler to 
employ other finite difference methods. Where the intervals arc 
equal an advancing difference formula may be used, and for 
unequal intervals it is preferable to use divided differences. 

The principles on which this formula has been developed are 
the same as those assumed for the difference formulae, namely that 
n values of the function being given, nih differences are assumed 
zero. The following examples show the application of the formula: 
Example 3. 

Given the data in Ex. i, obtain by the use of Lagrange’s formula. 

__ «8 __^ ___ _ „ i 

(8- i)(8-2)(8-4)(8-7)(8-ia) (i-2)(i - 4)(i - 7)(i - 12^(8-1> 

, _ _"2_ 

(2-i) (2-4) (2-7) (2-12)‘(8-2) 

+_ “4 _+ 

(4-1) (4-2) (4-7) (4-12) (8-4) ’ 

Z/g _ 22 30 

7. 6747 ^. (-4) “1) (- 3H -'6) (- 11). 7 I 7 ^- 2) ( -Ik-" 70)7 6 

82 106 

'*’ 3 - 2 .(- 3 ) (-^)- 4 ’^ 6 . 5 . 3 .(- 5 )r 7 
206 

■'■11.10.8,5.(-4)’ 

A ■“ 10*666 ... +33*6 —95-666 ... -f* 158-293 ... +7-865 

= 93 (to the nearest integer) as in Ex. 2. 
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Example 4. 

Find the form of the function y — given that 

Wo = 8, «4 = 68, «5 = I23. 

By Lagrange’s formula: 

_ 8 III I 

x(;«-i)(jf-4) (A;-s)~(-i) (-4)'(“.I7)‘a:'^i ( -3) (-‘^'x-i 

68 I 123 I 

4 . 3 (-i)'x -4 s. 4 .i‘*-s 

2 I II I 68 I 123 I 

= . - . - +- -- 

5 JC 12 X —1 12 JC —4 20 ^ — 5 

__ I ii5.rH-4o I 57X —24 
20 a: (.V — 5) 12 (jc — I) {x — 4) 

_ 23^: +8 I9a: — 8 

= i + 8) (*- - 5 a; 4 - 4) - (i9»: - 8) («- - 5A;)] 

= -I- 3 ^: 4 - 8. 

It is useful to work out this example by divided differences, adopting 
two different orders for the values of x^ thus illustrating the principle 
that, if the same w’s are involved, the order is indifferent. 


(«) 

X 


Awx 

A“ux A*UjB 


0 

8 

3 ^ 1 = 3 



I 

11 


1644 = 4 




57-^3 = 19 

5-5 = 1 


4 

68 

55-^1 = 55 

3644 = 9 


5 

123 




Wx 

= 8 + 3^ + 4^ (a; — I) 4 I a: (vV — 

i) (^ —4) = —X243X48. 

w* 



Aw* 

Afujp A*Ma. 

5 


123 

-1x54 -5 = 23 


0 


8 


-84-1=8 



68 

4 60 444=15 

-44 -4 

4 



444 41=4 


- 57-^ -3 = i9 

I II 
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/. 1/^ = 123+23 (*-s) + 8(ar-5)A;+i (a;-s)»(*-4) 

= 123 +23*— 115 + 8a:“ — 40* + — 9^;“ + 20a; 

=:X^~X^ + 2X + S. 

Here, for example, 

A«i = A«i=I9, 

4 1 

AX= A®« 5 = I- 

1 , 4,5 0 , 4,1 

13 . The following examples are instructive: 

^ Example 6. 

Wo= ~i8, 2/i = o, u^-o, 1/5= -248, We = o, 7/9 = 13104. 

Find the form of assuming it to be a polynomial in x. 

Now since «i = o, 7/3 = 0, 7/g = o, the function must be of the form 
— i) (a; — 3) (x - 6) ^ (^), where ^ (:v) is a polynomial in x of the second 
degree. 

(x- i) (a:~3 ) (^--6)"”^ 


I.e. --"<L_ 

(-i)(- 3 )(- 6 ) 

= il>( 0 ) 

</> (0) = I, since ?/(> = 

~ 18 

^5 

4.2(-l) 

=^'(5) 

ti 

i-T 

II 

- 248 


= <^( 9 ) 

<^( 9 ) = 91. .. "» = 

I3IO4 

Whence, from the divided difference table, 


X 

^(*) 

A 4 (a:) AV( 3 e) 


0 

I 

6 


5 

31 

I 




IS 


9 

91 




fj) {x) = (f>{o)+xA<f>(x) + x(x- 5) (x) 

+6x + x {x — 

^x^-hx+i; 

7/^. = (:v - i) (a: ~ 3) (a: - 6) 4. .v + i) 
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Example 6. 

Given «6 = 23, 1/11 = 899, W 27 = i 73 i 5 » "34 = 35^>o6, 1/42 = 68510, con¬ 
struct a table of divided differences and extend the table to include 
arguments x = 2 repeated as many times as may be necessary to find 
in powers of (x — 3). 

From the data we have 


X 


Awa; 



5 

23 

146 



II 

899 

1026 

40 

I 

27 

I73I5 

2613 

69 

I 

34 

35606 

100 




4113 


I 

42 

68510 

b 

a 

I 

3 

c 

e 

d 

I 




f 

1 


a — 

100 — 24 X I = 

76 




4113-31 x// = 

1757 



c =68510 — 39 y.b — 

- 13 




(/ —31x1 = 

45 



e — 

b —39 X d — 

2 



/ = 

d -39x1 = 

6 



/. u^=^c + (x-2,) e + {x-2)^f+{x-2fxi 

= -13 +2 (A;~3)-f 6 {x-^)--h(x-'^f. 


14 . Newton's formula with divided differences may be considered 
as the basic formula in finite differences. It has been shown that, 
by making the intervals equal, the ordinary advancing difference 
formula follows, and that Lagrange's formula can be evolved from 
the divided difference formula by equating the nth divided dif¬ 
ference to zero. Moreover, by taking the limiting values when the 
intervals tepd to zero, Taylor’s theorem can be obtained. 

The formula is of the utmost importance analytically and 
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historically, and the advanced student may be recommended to 
read Mr D. C. Fraser’s “Newton’s Interpolation Formulas” 
(J.LA, vol. LI, pp. 77-106 and pp. 211-32, and vol. lviii, pp. 53-95) 
and the same author’s “Newton and Interpolation” (an article 
in Newton^ 1727-1927, a memorial volume published by the 
Mathematical Association). 

"^15. Adjusted differences. 

There is a system of differences which may be considered as 
the connecting link between ordinary differences and divided 
differences. These differences, which are called adjusted differences^ 
were used by Newton and re-discovered by Sheppard. 

When the successive arguments are a^b.c the relation between 
adjusted differences and ordinary differences is as follows: 


Order of 
difi'erences 

Divisor of difference 
for divided 
differences 

Divisor of difference 
for adjusted 
differences 

First 

a — b 

a — h 

Second 

a — c 

l{a-c) 

Third 

a — d 

i (a-ii) 

Fourth 

a — e 

i (a-f) 


Ordinary differences are adjusted differences and both sets can 
be used in the same scheme. 

An interesting account of the Newton-Sheppard system of adjusted 
differences will be found in J.LA. vol. lviii, pp. 60-74 (^* Fraser). 

♦16. An expression for the nth divided difference. 

The results of paragraph 5 may be written in the form 

where Rn+i vanishes for n+i values of x, say a^b.c L Hence, 
by repeated application of Rolle’s Theorem (Part I, Chap. IV, 
p. 62), it follows that, in the interval which includes these values, 
the first differential coefficient of R with respect to x vanishes at 
least n times, the second differential coefficient vanishes at least 
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w — I times, and finally, the nth differential coefficient vanishes at 
least once, say where x = 



Now is of the form P,, {x)y and since the wth differential 


coefficient of P,j {x) with respect to jc is «! times the coefficient of 
it follows that 

U^ = n\ 


dn 


be ...l 


or 



! ^ 

be... 




for some value of x (^) within the range which includes a, b, c ... I, 
and X . 


’*'17. Remainder term in the divided difference formula. 

If in the proof above we bring in another term we have 


dn+l 

where , u 




bel... X 

is written for 



This is the divided difference involved in R„+i (x). 


^ 1 (a:) = - a) (a: - - /) J +1)!. 

When the intervals are all unity, this becomes 

dn n 

41 (•^) ~ ^(n+l) l ’ 

where | is some value in the interval including all the arguments 
involved, i.e. the given arguments a, b ... I and also x. 
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EXAMPLES 3 

I. Given terms at unequal intervals, explain how to apply the method 
of divided differences to find an interpolated value: illustrate your 
answer by finding ii^ given 

%6 o = 1345 . “4-55=1470, 1/4.70 = 2010, 1/4.00 = 3815, «6-I6 = 10965. 

2 - “40=43833, “42=46568, 1/44 = 49431, 1/45 = 50912. Use divided 
differences to find W43. 

3. Given the following table, find log 656: 

No. 654 658 659 661 

Log 2-8156 2*8182 2*8189 2*8202 

4. i/5„= 1-6990, 1/62 = 17160, 1/64=1-7324, 1/55=1-7404. Find 1/53 by 
divided differences. 


5- “35-0 = 1175, “35-5=1280, 1/39.5 = 2180, 1/49.6 = 2420. Obtain a 

value for (i) by advancing differences, (ii) by divided differences. 

6. M7-0 = 235, = 256, = 436, 1/8-1 = 484. Findt/g-o- 

7. Find the first three divided differences of the function y = a:-- for 
the arguments x — l, //, p. 

Find by Lagrange's formula the value of 

8. z/48 given tt4o = 15-22, 1/50=12-62, Wr5 = ii-i3. 

9. given//„ =17*378, 2/5 = 15*894,//ic= 14*270, w,5= 12-412. 

10. M22 gi’^on//io = 22-40, Wj5 = 2i*66, 1/20 = 20*82, 1/25 = 19*85. 

11. z/j given = *400, z/g = *128, z/g = *224, z/4 = *376. 

12. Use Lagrange’s formula to find the form of the function y=f (a:) 
given 

a; o 2 3 6 

f{x) 659 705 729 804 

13. Values of u,. are given for all integral values of x from o to « — i. 
Show that is capable of expression in the form 

__ 

(*-«)!(«-!)! [a:-K+I ^ 


+ (“-l)( 2 ) 


x-n-h2 

Un 


'♦n -3 




Find 1/4 given 1/0 = 4, “i = 7 > “2=12, 1/3 = 30, by using the above 
formula. 
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14. By means of Lagrange’s formula, prove that, approximately, 

(1) = M3 ~ *3 (ws ~ W-3) + *2 (m _3 - m_ 5 ), 

(2) Mo = i [Ml + M_i] -i[l (W3 - //j) - I (w_i ~ M_3)]. 


15. Four equidistant values w_i, Wq, Wj, and Mg being given, a value 
is interpolated by Lagrange’s formula. Show that it may be written 
in the form 


where x +y = i. 


y (v^ “ i) . o a: ( x - - i) . „ 
=yuQ + xu^ -J 


16. If /(«i, «o) 


etc. 

Ml ~ Mq ^2 — «! 


be 


divided differences of the first order;/( mo, Mj, ^ 

^2 ~ ^0 

etc. divided differences of the second order and so on, find/ (2, 4, 9, 10), 
where / (:r) = (i) x^ — zx^ (i i) x"* + 4* i. 


17. Prove that if he a polynomial of the «th degree in x^ and if 
values M^, Wj,, «<,, ... of be given, then the expressions for in terms 
of its divided differences are identically equal whatever the order of 
arrangement of the u's. 


18. Apply Lagrange’s formula to find / (5) and /(6), given that 

fix) = 2, /(2)=4, /(3) = 8, /(4) = i6 and fi 7 ) = xzS- 

and explain why the results differ from those obtained by completing 
the series of powers of 2. 

19- W- 3 o = 3 o; «-i 3 = 34 ; W 3 = 38; Mi 8 = 42. Find Wo- 

20. u^Q — y- 6 g; M72 = 7*o7; u>j^ — 6 'yS; W75 = 6*i8. Interpolate to find 
M71 by divided differences,using the following orders of the argument: 

(i) 70, 73 » 7 S» 72; (ii) 72, 75 » 7 o» 73 - 

21. By means of divided differences, find the value of Miq from the 
following table: 

X II 17 21 23 31 

14,646 83,526 194^486 279,846 923^526 

22. Prove that ^ +y + ^ and that 

yz 




FINITE DIFFERENCES 


6 o 

23. If the data are Wq, Wg, W4, ^7, Mu, and the interpolation formula is 

~ ^^4 “1" A ^4 "h ^2 A“^3 "h Cg Mq + C4 /t\^ Mq , 

3 4,7 3,4,7 3.4,7,11 

find the values of Q, Q, Cg and Q. 

24. If the data are as in Qu. 23 above, and the formula is 

= + At^^(x-3) (^- 4 ) A"w + (^~3)(^“-4)^A^« 

+ (‘^-”3)(^-'4)^(‘^~7) A^^^ 

find the missing suffixes of the operators and the subscripts of the i/’s. 
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CENTRAL DIFFERENCES 

1. If a series of values of Uj, be given, we can interpolate to find 

any intermediate value by one of the methods in the preceding 
chapters. Where the values of the argument x proceed by unit 
intervals it has been shown that, on certain assumptions, Newton’s 
advancing difference formula can be applied to give satisfactory 
results. If the value of were required for some value of x be¬ 
tween x = o and I, it might be considered that we should obtain 
a better result if our knowledge of the shape of the curve extended 
on both sides of the values of x between which w^e wish to inter¬ 
polate. I'hat is to say, where wx" may choose any values of at 
unit intervals for our data, it might be of advantage if wc could use 
a formula involving values such as z/_3, u^y Wy, ... 

rather than Wq, u^y W3, .... By the advancing difference 

formula wx expand u^. in terms of a given value of and its leading 
differences and, by giving a; a suitable value, such a formula can 
be made to embrace any values of u that we wish. It is however 
convenient to use special formulae called central difference fonnulae, 
based on differences obtained from the values of u^, on either side 
of the origin; this is found to result in smaller coefficients and a 
more rapidly converging series of terms. 

2. There are various central difference formulae that are of use 
in actual practice, and the development of the better-known 
formulae is an exercise in the application of the fundamental 
principles of finite differences which w^ill be advantageous to the 
student. These formulae apply whether the values of the function 
correspond to equidistant values of the argument or to values with 
unequal intervals. We shall consider first the general case and pass 
from it to the simpler and more usual case of equal intervals. 
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3. Gauss’s formulae. 

Suppose that we are given the values ... 7/^... and 

that we wish to deduce a formula for in terms of the even 
differences which fall on the line of and the odd differences 
which fall on the line between and as shown by the lower of 
the two series of dotted lines in the following difference table: 


! 

til 

la 

,A®«„ 

a ^ ^ lab 

ab lobe 

b ahe 

Ub AX 

bo 

AK* 

c 

tie 

The formula will thus involve in succession 

A«a. A-M;, A®M/, AX„ .... 

b abt abc labc 

As we pass from each of these to the next, the new arguments 
brought in, one at a time, are b, /, r, m, .... Hence, applying 
rule (iii) of Chap. Ill, paragraph 7, namely, that each new argu¬ 
ment gives the new factor for use in the next term, and adopting 
the abbreviated notation x — a = A, x — b = B etc., we can at once 
write down the required formula. 

The formula will be 

«X = “(I + ^ A Ma++ ABLA^u,+ABCL + ..., 

b ab abc labc 

where the law of formation is evident. 

Now let the intervals between the arguments be all equal to 
unity and let t2 = o, so that c = 2, /=—i, ni^—2, 
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Then, as in Chap. Ill, paragraph 10, we have 

Alia =AMn; 
b 

= A"m_j/2 ! j 
ab 

=A»m_i/3!; 

abc 

lahc 

Further, A=x~a = x, 

B=X~h = X— ly 

L — x —I = a: -f I, 


The formula for equal intervals is thus 

. i) (jc—i) 

= --—p- 




or 


Uq-{-X(i) AUq + X(2)^^U__^ + {x-\-i) (3) + (^ + I)(4) A^H-.... 

This is known as Gauss’s “forward” formula for equal intervals. 


4. If we take the even ditterences as above but the odd differences 
which fall on the line between Ui and w„ (the upper of the two series 
of dotted lines in the table in the preceding paragraph) we obtain a 
formula which involves in succession 

«a, AM;, A-M„ A®M„„ AX. •••• 

a ab lab labc 

The new arguments brought in are /, />, m, Cy ... and, using the 
rule as before, we evolve the form 

— + AL/^Ui 4 - ABL/^u,^ 4 - ABLM/^u„^ -h.,.. 

a ab lab labc 
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From this we may pass at once to the formula for equal intervals of 
unity: 

+ + (x +1) ( 2 ) + I) (3) A3w_2 

+ Cx: + 2 )( 4 )A 4 w_. 2 +.... 

This is Gauss’s ‘‘backward” formula. 

Note. It is useful to remember that, after as we advance from 
each term to the next, in the forward form wc alternately deduct unity 
from the subscript of u and add unity to the number whose factorial 
is the coefficient; while in the backward form we alternately add unity 
to the number whose factorial is the coefficient and deduct unity from 
the subscript of u. 


5. Stirling’s formula. 

Taking the mean of the two Gauss formulae we arrive at the 
following expansion: 


"x ^"o + x.l (AW(, f Aw_,) + ■ A-)' J 


+ 




.HA«w_i + A«//,„) F 


^>2 ^ ^.2 _ j 2^ 


AhoH" •••» 


in which we have alternately mean cocjficients and mean differences^ 
falling on the line through in the difference table. 

This is known as Stirling’s formula. In using it x should lie in 
the range — I to -f i.e. \ on each side of the line of Wq. 


6. Bessel’s formula. 

Transferring the origin in the Gauss backward formula from 
o to I, we have 


. ^ , x(x—i) X (x~~ j) (x~ 2) 

«X = Ui + (x-i) Ai/o + - -A-m,, + --/- 


{x 4 -1)-v (x—1) (jc —2) .. 

+ y -^ .V \- ' +... 

4! 


The mean of this and the forward formula is 

“x= l («0 + «I)^ (■»-!) 

+? <* f) J (A',c,+AX)+"1'- AX, 

2 . 3 ! 

(x-d-1)(x?—i) (a? —2) , , . . .. . 

, ^ I (A4w„34-A^t/_2) + ..., 
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which is Bessel’s formula. In using the formula x should lie in 
the range o to i, i.e. | on each side of the central line. 

In this formula we have alternately mean m’s or differences, fall¬ 
ing on the central line between Uq and and mean coefficients. 

If we write y + l for x in Bessel’s formula, we obtain the simpler 
and more symmetric form: 

«X = Wj,+J = I («o + «i) + jAmo + J * I (A2m_i + AX) 


+ 




AX, + I (AX. +AX,) + ..., 


•the most convenient shape of the formula for practical purposes. 


7. An alternative method of obtaining Gauss’s forward formula 
for equal intervals is as follows: 

The ordinary advancing difference expansion is 

4 - a:(i) Awo + -f x-o) AHq -f x^^ A\ -f .... 

Now = A^w_i 4 A^w„3 ; 

A^'^z/q = A^w_.i + A%,_i; A^z/... j == A^w_2 + A^w_2 > 


Therefore we may write 

= Uq + ^(1) Auq + X( 2 ) + A^w^i) 

4- x^ 2 ) (A^m_i 4 - A^m_i) 4 x^^) (A^z/_i -f A®w„i) 4-... 
= Wo 4 - a:(i) Awo 4- x^ 2 ) + (^(2) + %)) 

+ (%) + %)) A4 w_i 4-... 
== Wo 4- ^(1) Awo + ^-(2) A^w^i 4 (a: 4 I )(3) A^w_i 

4 (a: 4 I )(4) (A^w_2 4 A'^w_2) 4 ... > 
since X(^j.) 4 •x^(r+i) — (a’ 4" i)(r+])» 

and so on. 


A proof similar to the above, in which the general term is evolved, 
and which depends upon the method of separation of symbols, will be 
found in y,LA. vol. l, pp. 31, 32. 
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The backward form may be obtained similarly by grouping the 
terms in a slightly different way and using the relations 

Az/q = Ai/„i + A-m_.j , 

-f A‘^w_i, 

= A®w_ 2 + 2 > 

etc. 

8. Everett’s formula. 

The Gauss forward formula with interval x and initial term 74 
may be written 

+ (x+2 )(5 )A^W..j + .... 

The backward formula with interval (jc—i) and initial term 
gives 

v^ = Vi + {z-i )a)AVo + X(2,A2t;o + a:( 3) A'V.I + (a;+ i)(4, 

+ (a;+ i)(5) A''’rL2+.... 

Subtract the second scries from the first: then, since 

^x+i-^’x = Ar;^, 

we have 

Av^ = *(i) At’i + (a; +1 )(3) A*r;o + (* + 2 )( 5 ) A57;_i +... 

- -1) (1) At’o - ^(3) - (ac +1) ( 5 , A'‘:'_2 -... . 

Put «4,, Au^, AX. ••• for Aw^, AX. AX. 

Then 


«X=aC(i) «I + (ac +1 )(3) A\ + (af + 2 )(5) A%. 1 +.. . 

-(ac- i)(i)Mo-a;(3)AXi-(ac+ i)(5)A'*m.. 2-.(i) 

When X is less than unity a convenient form of this formula for 
interpolation between Mq and Mj is obtained by putting ^=i—x; 
thus 

+K+A’"-.+A%_,+..., 

the most common form of Everett’s formula. 
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The above elegant proof is due to G. J. Lidstone {J.I.A. vol. lx, 
pp. 349-52). In his note on this formula Mr Lidstone shows how to 
obtain another formula similar to the above for interpolation between 
w_i and u^. 

If we difference formula (i) bearing in mind that 
A = + 

we have Az/,. = -f +1 )( 2 ) A-z/o + 4 - 2)(4) A^//_i 4 -... 

—- Wq — ^^(2) A“W_2 — (x4- i)(4) A'*W. 2 — . • 


or, writing for Awa., A^a; for \^Uj. and so on, 


= Mo + (^ + l)(2) Az/o 4 - 4 - 2)(4) A=^Z/„J 4 - . .. 

— — (x 4 -1)(4) A^u_^ - .... 

The formula then becomes 




where ^ 4- ^ and q = ^—x. 

This form is generally known as Everett's “second'' formula. Both 
formulae, perhaps more particularly the first, are specially adapted for 
use in statistical work. They can be applied in the case of unequal 
intervals: see Todhunter, y./..d. vol. l, p. 137, and Lidstone, Proc, Edin, 
Math, Soc, Series i, vol. XL, pp. 25-6. 


9. Everett's formulae can also be obtained from Gauss's formulae 
by a simple transformation depending upon the properties of the 
binomial coefficients. 

Any pair of terms of the form 

^(r) + (3^ + I) (^+1 ) 

= (j+ l){r+l) (A''«, + - {(l)(, , 1 , 

= (>”+0(r+l)^’'«^+l-J(r+l) 
since {y + -y^^) =7(r+i)- 

Taking the pairs 

i~2; 3-4; 5“6 ... in Gauss’s forward formula, 
and 2-3; 4-5; 6-7 ... in Gauss’s backward formula, 
the two forms of Everett's formula are at once obtained. 


5-2 




68 


FINITE DIFFERENCES 


*10. It is of interest to show in a simple manner the relations be¬ 
tween Everett's and Bessel's formulae. The formulae are closely 
related, and the following demonstration indicates clearly the con¬ 
nection between corresponding coefficients in the two expansions. 

In Bessel's formula let ftgn denote the coefficient of the “mean" 
difference S 

and ^2n-fi coefficient of the difference 

Then, since , 

the sum of the terms 

K ■ 2 + A2«m_„+i) + b^,,^ 1 A2”+1h_„ 

may be written as 

^2,, A2"M_„ +(J 62 „ +Ai!„Hi) A2«+1m_„ .(fl) 

Consider the expression 

e2„A‘‘*«M_„+€2„A2»M_„^l. 

I'his is the same as 

(«2n + «2r,) A2»+1 m_„. . {b) 

Again, by reference to Bessel’s formula, it will be seen that 

L L / \ 

. 

Equating (a) and (Z^), and introducing the relation (c), we have 
^2n "b ^2n ~ ^2n > 

and €2,, = ^^271 d" ^2n+l 


whence 


^2n ~ \^2n ^2n+l 

= ^2»{i-(^-i)/(2« + l)} 


from which the coefficients already given in Everett’s formula can 
at once be deduced. 






ILLUSTRATIVE EXAMPLE 


69 


The same analysis applies to each pair of terms. 

It should be noted that 

(i) €j. and e^. are the same functions of x and i—x respectively; 

[In fact, «2« = (-* + w)(2«+i), 

«2«= - (•^ + «-l)(2,Ml) 

= (l--*’ + «)(2«tl)] 

(ii) the pair of terms in Everett’s formula is exactly equivalent 
to the corresponding pair in Bessel’s formula. In Everett’s formula, 
therefore, precise allowance is made for the next odd difference, 
although this difference does not appear explicitly. 

Similarly, by taking the pair of terms 

and 

in Stirling’s formula, Everett’s second formula may be obtained. 

11 . It will be instructive to compare the results brought out by 
applying first. Gauss’s fonvard formula, and, secondly, the ordinary 
advancing difference formula, to the same set of data. 

Example 1. 

Interpolate by means of Gauss’s forward formula to find the present 
value of an annuity of i p.a. for 27 years at 5 per cent, compound 
interest, given the following table: 

No. of years 15 20 25 30 35 40 

Annuity-value 10*3797 12*4622 14*0939 15*3725 16*3742 17*1591 

If we take 25 years as the origin and 5 years as the unit, the value 


required will be w.4. 





X 


Auj. 

A=»x 



— 2 

10*3797 

2*0825 




— I 

12*4622 

1-6317 

- -4508 

•0977 


0 

14*0939 


-•3531 


I 

6 



1*2786 


•0762 

•0054 

I 

^ 5'3725 

1*0017 

- -2769 

•0601 

— *0161 

2 

16-3742 

— *2168 





•7849 




3 

17-1591 
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The Gauss forward formula is 

Uj, = Uq + AWo + ^( 2 ) A 2 m_.j +(x-\-1 )( 3 ) A^U_^ + (‘^ + I)( 4 ) 

4-(^-f 2 )( 5 ) A^*W_ 2 * 

When X = *4 the successive coefficients are 

•4; —-12; --056; -0224; *010752 

and to four decimal places the value of w .4 is 14*6430, which agrees with 
the tabulated value. 

To apply the advancing difference formula we take 15 years as the 
origin and are required to find when :v = 2'4. 

In the formula 

u,, = Uq 4- x^^^ Ai/o + a:(2) A'\ + a:( 3) A% -f x^^) A\ + A^ 

the coefficients are 

2*4; 1*68; *224; -*0336; -*010752. 

On evaluating the expansion we obtain for W2.4 the value 14*6430 as 
above. 

It will be seen that the two results are in agreement (as indeed 
they must be, since the same values of u are used), and it may be 
asked therefore wffierein lies the advantage of using the central 
difference formula. This question will be discussed in the next 
paragraph. 

12 . Consider an approximation to a particular value of based on, 
say, r values out of n available. The error in the approximation, 
as measured approximately by the first neglected term, is least 
when the coefficient of that term is least. It can be shown that this 
happens when the values of upon which the interpolation is 
based range round the space in which x falls, so that x is as nearly 
as possible central. The central difference formulae give a systematic 
method for building up the table subject to these conditions. 

Again, the central difference coefficients are as a general rule 
smaller and more rapidly convergent than those required for the 
calculations in the advancing difference formula (as will be seen 
in the Example) and, by a suitable choice of origin, the arithmetical 
w'ork may be reduced to a minimum. 

It should be noted that, in the phrase “as measured approximately 
by the first neglected term’’, this measure is not theoretically complete; 
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it is however generally sufficient in practice if the first neglected order 
of differences is constant or is changing but slowly. When this is not 
so it will not necessarily be true that a central difference formula 
beginning with Uq is more accurate than the advancing difference for¬ 
mula beginning with the same term. [See p. 337 of Sheppard’s paper, 
cited on p. 72.] 

We may even make the result worse by introducing differences of 
a higher order (cf. Chap. V, paragraph 15). In general, however, these 
anomalous cases arise only rarely. 

13. It should be noted that the greater accuracy of central dif¬ 
ference formulae as compared with the advancing difference form 
is not due to the formulae but to placing x in the central interval of 
the range of the given values. Provided that this is done—which is 
very important—it is immaterial whether we use a central dif¬ 
ference formula or the advancing difference formula with the same 
w’s. A disadvantage of the latter form is that the coefficients are 
larger and are not tabulated as, in practice, are those of the 
principal central difference formulae. 

14, Relative accuracy of the formulae. 

The relative accuracy of the various central difference formulae 
can be investigated in an elementary manner on the following lines. 

The Gauss forward formula is 

Wj. = -f ) Awq -f Xio) + (x + 1 ) (3) + (x + I) (4) 2 + .. ■ . 

If we expand by Stirling’s fonnula as far as a certain order 
of even differences we shall obtain by a simple transformation the 
above formula to even differences, for the same w’s are involved in 
both. Similarly, Bessel’s formula and Gauss’s formula are identical 
to odd differences. Now the Gauss formula involves only ordinary 
differences while the other two series involve mean differences of 
the form ^ (A’^z/,. +A^i/,.^1). If instead of proceeding to constant 
differences the series stop short at, say, rth differences—^which are 
not constant—the use of any of the formulae will involve an error. 
It remains to examine which of these formulae gives the best result 
in different circumstances. 
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The following demonstration is based on that given in greater 
detail by Mr D. C. Fraser in JJ.A. vol. L, p. 25: 

Suppose that x is not greater than -5. Then, by calculating the 
coefficients in Gauss’s formula, it will be found that for positive 
values of x none of the coefficients (except that multiplying Awq) 
differs greatly from ± -5 times the preceding coefficient. (See 
Table, Jf.l.A. vol. l, p. 25.) Thus the terms after that involving 
the third difference are approximately equal to 

(.V+ l)(4) (A^+ iA^) 

i.e. to (jc-f i)(4) i (A%_2-h A'^// _i). 

If therefore we substitute I (A%_2 + ^^^-i)j the mean difference in 
line with for A^w„2 in Gauss’s formula, we make the fomiula 
very nearly correct to fifth differences, without having to calculate 
the actual coefficient of the fifth difference. The substitution there¬ 
fore greatly improves the accuracy of the formula. 

When, however, the substitution is made, it will be found to 
reproduce Bessel’s formula to the fourth mean difference. There¬ 
fore Bessel’s formula to fourth mean differences is usually more 
accurate than Stirling’s to the fourth difference. 

It may be shown similarly that Stirling’s formula to odd mean 
differences is usually more accurate than Bessel’s to the same order 
of differences. 

The above demonstration is only approximate: a strict investigation 
into the relative accuracy of central and advancing difference formulae 
requires rather more elaborate mathematical discussion. (See Whit¬ 
taker and Robinson, Calculus of Observations^ p. 49; Lidstone, T.F.A. 
vol. IX, pp. 246-57; Fraser, JJ.A. vol. L, pp. 25-7; Sheppard, 
Proceedings of the London Mathematical Society^ Series 2, vol. iv, pp. 
320-41 and vol. x, pp. 139-72.) 

Mr D. C, Fraser has given the following criteria summarizing 
the properties of interpolation formulae: 

(i) Formulae which proceed to constant differences are exact 
and are true for all values of n whether integral or fractional. 

(ii) Formulae which stop short of constant differences are 
approximations. 
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(iii) Formulae which terminate with the same difference are 
identically equal, for they involve the same m’s. 

It should be noted that these rules are quite general; they apply 
to all formulae based on finite differences, not ending with mean 
differences. 

15 . Apart from the general superiority of central difference 
formulae certain of the formulae possess distinct advantages in 
special circumstances. For example, for the bisection of an 
interval Bessel’s form is convenient, since the alternate terms are 
zero. 'We have, at once, 

Again, in using Everett’s formula for the subdivision of inter¬ 
vals the terms are such that they may be used twice: they occur 
both in an expansion and in reverse order in the next 
expansion. An example will make this clear. 

Example 2. 

Given x 30 35 40 45 50 55 60 

z/j. 771 862 1001 1224 1572 2123 2983 

obtain values for for all integral values of x between x — ^o and 
X — 30* 

The difference table is 


X 

W, 





30 

771 

91 




35 

862 

139 

48 

36 


40 

1001 


84 

41 

5 

45 

1224 

348 

12s 

78 

37 

50 

1572 

551 

203 

106 

28 

55 

2123 

860 

309 



60 

2983 
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Everett’s formula gives 

= •»(!) «1 + (^ + l)(3) + (* + 2)(6) A*K_i + ... 

+ ^(l)“o + (f + '')(3) ^^“-1 + (^ + 2)(S) A%_2 + ..., 
aino !Vj,.j = ^,i)«2 + (^ + i),3) A'^Uj + (^ + a),^, A\ + ... 

+ «( 1 ) «1 + (* + I )( 3 ) AX + (* + 2 )(S) ^*“-1 + • ■ •. 

and the second line in Uj, ^ is the same as the first line in ?/ ,. 

Since the data are given at quinquennial points and we require 
values at individual points, we may write x = ‘2, *4, *6, ... and 
i = '8, -6, *4, .... The first line of u.2 will be the same as the second line 
for Wj.p, but in the reverse order, and so on. 

The coefficients of the terms in the first line of the formula for 7/^. are, 
to fourth differences, 

•2 “'032 -006336 

‘4 -*056 *010752 

•6 “*064 '011648 

•8 ~ *048 *008064 

The work is best arranged in tabular form, thus: 






: Sum of 



X 

XUi 

- 


first three 
terms 

Sum of 
second 

Inter¬ 

polated 



.1! 

51 

(ii) + (iii) 

three 

result 





-kiv) ' 

terms 

(v) + (vj) 

(i) 

(ii) 

(iii) 

(iv) 

(v) 

(vi) 

(vii) 

•2 

200*2 

- 2*7 

0*0 

197*5 



•4 

400*4 

- 47 

0*1 

do 



•6 

600*6 

- .S*4 

0*1 

595-3 



•8 

8oo*8 

- 4*0 

0-0 

796*8 



*2 

244-8 

- 4*0 

0*2 

241*0 

796-8 

CO 

T' 

0 

•4 

489*6 

~ 7*0 ; 

0*4 

483-0 

595-3 

1078*3 

*6 

734*4 

- 8*0 

0*4 ! 

726*8 

395-8 

1122*6 

•8 

979*2 

- 6*0 

0*3 

973-5 

197-5 

1171*0 

•2 

! 3H*4 1 

- 6*5 

0*2 

308-1 

973-5 

1281*6 

* 

628*8 j 

-11*4 

0*3 

617-7 

726*8 

J 344*5 

' *6 1 

943*2 1 

-13*0 

0*3 

930'5 i 

483-0 

1413*5 

•8 

1257-6 ; 

“ 9*7 

0*2 

1248*1 

241*0 

1489*1 


The only column which needs explanation is column (vi). This 
column represents the second set of three terms of the formula, correct 
to fourth central differences, and is obtained by writing down in the 










ILLUSTRATIVE EXAMPLE 75 

reverse order the values of the previous column applicable to the sum 
of the first three terms. 

It should be mentioned that the values in column (vii) of the table 
do not quite agree with the tabular values: the tabular values are 1038, 
1081, 1122, 1172, 1281, i345» 1415, 1490. The reason for this is that 
the function upon which the original values depend is not a rational 
integral function of the independent variable and that therefore a 
formula based on finite differences is only an approximate representa¬ 
tion of the function. The example is based on the rates of mortality 
according to the Table, the data being 10^ times the probability of 
dying in the year of age x. 


16 . The following example is instructive: 


Example 3. 

Use an appropriate formula to obtain successive approximations to 
^ 28-3 given = W 27 = *037037> ^<>8 = *035714, -034483, 

W30 = ' 033333 - 

It should be noted that, where the data are extensive and it is 
required to obtain successive approximations to a result by the use of 
some or all of the data, it is more advantageous to use a central difference 
formula than advancing differences. 

From the data, we obtain 


X 




io®A=’t/a. 


26 

38462 

-1425 




27 

37037 

-F323 

102 

— 10 


28 

35714 

-1231 

92 

— II 

— I 

29 

34483 

--1150 

81 



30 

33333 






In its simple form, Stirling’s formula is 
= Wo + i ('^“0 + + 2 ! '^^-1 






■(.v2-i2) 


4! 


A%_2. 


Taking the origin at 28 and letting x — -} we have, as a first approxi¬ 
mation, 


“•3 = 3S7I4 + -3 [i {-1323-1231)] =35331. 
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For the next approximation all that we need do is to add 

•09 

'2 92 = 4 * H- 

^^*3-35335*14 (say 35335 )- 
Similarly, the next term is -21) 


so that 


= •478... 

^'•3-35335*^>2... (say 35336). 


The addition of the next term will not affect the last figure. 
The required successive approximations are therefore 

•03533-035335. *03533^. •o 3533 ^>' 


*17. Sheppard’s central difference notation. 

Just as A== 7 ?—I, or = —similar symbolic identities 

may be deduced from the relations existing between and 

Dr Sheppard has introduced the following notation, which is 
widely used by mathematicians: 


Bu^~u 


'x+l * 




and 




In this notation the difference tabic takes the following form: 


X 




_ 




— I 


S«_J 

8V 

0 


8\ 

8M4 

I 


S-t/j 










8^Ui 


S^Un 


It is specially to be noted that the subscript corresponds to the 
argument which falls on the same line as the difference. 
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The relationships between E, 8 and fi are quite easy to establish. 
8== £-i [£_ i] = A. 

Also 

and I (£-£“!)= 1 (A/^l + A), 

/xS2^+^ A2^ 

= I (AE-14- A) -1 A‘^^^+1 (£-1 + i) £-«. 

Again 2/>t = zE^ — 8, 
or E^=ix-\-^h» 

By means of these symbols the central difference formulae can 
be written down in very convenient form. 

For example, Gauss’s forward formula is 

= «o + *^( 1 )+ ^( 2 )+ (‘^ + I)( 3 ) 4 (a> + I)(4 ) 8X + • • •, 

and Stirling’s becomes 

= ^0 + + {-^ + 1 )( 3 ) + • • • • 


EXAMPLES 4 

1. Apply a central difference formula to obtain i/gs, given «2o=H» 
*^24 = 32> ^28 = 35» W32 —4®* 

2. Given Wj = 10, Wj = 8, = 5, tt„i = 10, find by Gauss’s forward 

formula. 

3. Use Stirling’s formula to find Wgsj given 1/20 = 49225, %, = 48316, 
1/30 = 47236, tt36 = 45926, W4o = 44306. 

4. Given 1/20 = 2854, 2/24 = 3162, W28 = 3544 » “32 = 3992; find by 
Bessel’s formula. 

5. /;r2i = 18*4708; //26= 17*8144; “29=17*1070; “33=16*3432; 

flg, = 15-5154. Find //go by Gauss’s forward formula. 

6. What are the practical advantages arising from the use of central 
differences in interpolation? 
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Employ Stirling’s formula to obtain successive approximations to 
(1*02125)^®, given 

(I-01)50 =1-64463; (i.o2)5o = 2-69I59; (1-03)50 = 4-38391; 
(i-oi5)'o = 2-10524; (1-025)50 = 3-43711. 

7. Find formulae true to third differences for the bisection of an 
interval 

(i) in terms of the two nearest values of the function and of 
differences of the functions; 

(ii) in terms of values of the function only. 

Apply either formula to find P35, given the values of at 20, 30, 40, 
50 to be 1313, 1727, 2392, 3493 respectively. 

8. Given the table 

310 320 330 340 350 360 

log a: 2*4914 2*5052 2*5185 2*5315 2*5441 2*5563 

find the value of log 3375 by a central difference formula. 

9. Prove that if third differences are assumed to be constant 

X (x^- i) , y i) , 

Uj, = xUj^+ -—^ A**«y4->’Wo+*=^^- 

where y — i—x. 

Apply this formula to find the values of to and z/jr to W39, given 
that Wo = 3oio, W5 = 27 io, ^10 = 2285, 2/15=1860, «26=i5^o 

and 2/30= 1835. 

10. From the table of annual net premiums given below find the 
annual net premium at age 25 by means of Bessel’s formula; 

Age Annual net premiums 

20 *01427 

24 *01581 

28 *01772 

32 -01996 

11. Apply a central difference formula to find / (32), given 

/(25) = -2707, / (30) = -3027, /(3S) = -3386, /(40) = -3794. 

12. Use Gauss’s interpolation formula to obtain the value of / (41), 
given /(3o) = 3678-2, /(3S) = 2995‘i. / (4°) = 2400-1. /(45) = 1876-2, 
/(so) = 1416-3. 

Verify your result by using Lagrange’s formula over the same figures. 
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13. Prove the following formulae for the general case of unequal 
intervals: 

(i) M» = Ma + ^ i(A«J+AMa) + M (-3 + -^') 

+ ABL J (A"«m + A“«/) +... (Stirling) 

(n) = \ {Ua + Ui) + \ {A + B) + AB i {/K% + A*«a) 

+ I {ABC + ABL) A?M(+ ... (Bessel) 

and show how to obtain a general Everett form. 

14. Show that any central difference formula can be developed 
from Lagrange. Apply a central difference formula obtained thus to 
find /(3I), given that /(a) = 2-626; 7(3) =3-454; 7(4) = 4784 and 
/( 5 ) = 6-986. 

15. Given u„, u^, u„, 1/3, M4, Mg (fifth differences constant), prove that 

where a = Wq 4- ; b — ii^ + u^; c — + w.j. 


16. A series is formed by the division of the terms of the two series 

I 2 6 24 ... w ! 

4 20 120 840 ...(«-f 3)1 

Obtain an interpolated value for W2jA’2i series by a central 

difference formula and compare the result with the quotient of by 
in the component series. 


17. The following is a difference table written down in Woolhouse's 
notation: 


«2 


^-a 






b-i 

bo 

b. 




do 


If i (^-1 “f ^1) — \ show that Stirling's formula 

(to fourth differences) can be expressed as 


War = Wq + + Bx^ 4- CX^ 4- Dx^, 


where B, C, D are functions of Cq, ^4 only. 
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18. Prove that in Woolhouse’s notation 

= «0 + ^(1) + ^(2) ^0 + (^^ + I)(3) Ci+{X+ I)(4, 

correct to fourth differences. 

19. Show that the sum of the terms of the series «_2, «_i, W], 

can be expressed in the following form 

4- BBhiQ + CB \, 

where S-Wq and denote the second and fourth central differences of 
Uq \ and find Ay B and C. 

20. By splitting up the fraction of the form 

__ 

(jc2-i.2) 

into partial fractions, show how to arrive at u^. in terms of known values 
of the function of which x occupies the central position. 

21. If Wa. = ^Wo-f SAwqH-CA 2i/_j4-..., i.e. a general expression for 
Wjr in terms of central differences, prove by expressing all differences in 
terms of advancing differences of Uq that 

= Uq 4- ^(1) Awo 4- x^2) A2w„i -f (;x +1 )(3) A^w. j +..., 

obtaining the general term in the expansion. 

22. Show that in the general divided difference interpolation formula 
any two successive terms can be reduced to a pair in Everett form. 
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INVERSE INTERPOLATION 

1. When performing direct interpolation, values of v corresponding 
to various values of the argument x are given and we are required 
to find a value of the entry y corresponding to a value of x inter¬ 
mediate between the given values. If it is required to obtain an 
interpolated value of the argument corresponding to an intermediate 
value of the entry, the process adopted is called “inverse inter¬ 
polation”. In other words, for direct interpolation we assume a 
curve y — passing through the points {x, y) and estimate the 
value of 3; corresponding to some intermediate value x': for inverse 
interpolation we have a similar curve but are required to find a 
value of x corresponding to a value y\ 

For certain functions we may obtain the result easily. If 
y = sin X, then .r = sin~^ jy; if y = x’^, then x —y ^; if y ~ then 
X = log yjlog a. The required values of x can be calculated im¬ 
mediately in these examples. 

On the other hand, if the data are simply corresponding numerical 
values of x and y, all that we can write down is a formula such as 
Newton's or Stirling's: we must then endeavour to obtain a value 
for x by solving an equation. For example 

J = ( 1 + A)-" 7^0 = W() + 5C(1) Amo +-^( 2 ) AX + ^( 3 ) + • • ■ • 

If second differences may be assumed constant we have a 
quadratic equation which can be solved at once. Should this 
assumption be inadmissible, then wc are faced with an equation 
of higher degree than the second and the solution of such an 
equation may be very laborious. In these circumstances we resort 
to approximate methods of solution of the equation. 

2. Consider the problem of reversion of series. If 

y=:hx + cx^ + dx^ -f ... 

and we wish to obtain an approximate value for x in terms of y, 


FM AS 


6 
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we may write x == By + Cy^ + Dy^ +... -f Ky'^ -f .... The coefficients 
By Cy D ... K ... may then be found by equating the coefficients 
of the various powers of x in the identity 

x—B {bx-\-cx--\- C {hx^cx^+ 

If y is numerically less than unity and we use only n terms in 
the expansion for a; in terms of y —i.e. we let 

x=^By+ Cy- +... + Ky'^ 

—we are neglecting only and higher powers. Since y is less 
than unity, the neglected terms will usually be small. 

If however y = = + ... -f A:(„)A^^z/y+ ... and 

we wish to find x from this equation, we cannot with equal safety 
neglect the terms ..., for these all contain .r, x^ .... 

It is thus seen that the problem of inverse interpolation, although 
analogous to that of reversion of scries, involves considerable 
difficulties. The best method of approach is from a practical point 
of view. 

Given a problem in direct interpolation, the results obtained by 
the use of an interpolation formula are jUvStified only on certain 
assumptions. Similarly, in interpolation by differences for an 
inverse function the results must be judged practically by the 
progression of the differences. It may be stated however that if 
interpolation to nth. differences is accurate enough for f {x) it does 
not follow that the same number of differences will suffice or will 
be required for the inverse function. 

3 . The problem of inverse interpolation may be viewed in two 
ways. We may, by graphic or other indication, observe that the 
value of X which we require corresponding to some given value of 

lies in a certain narrow interval. Thus, if we are given the 
following table : 

Rate of interest 

per cent. 23 3 3 ^ 4 43 5 

Annuity-value 87521 8-5302 8-3166 8-1109 7‘9i27 7-7217 

and we are asked to find the rate of interest for which the annuity- 
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value is 8*000, we may take the interval 4-4^ as the interval (o, i) 
or (— 4-1^) and write down an interpolation formula, using only 

a quadratic or cubic function. As the interval and the values of 
the variable are small and the differences are rapidly decreasing, 
the solution of such an equation will, in general, give sufficiently 
satisfactory results. If the equation is a quadratic the solution will 
present no difficulty; for a cubic various methods are available, 
some of which are discussed later. 

4. Alternatively, we may exchange the dependent and independent 
variables, 'fihat is, given a table of y — ii^ we may use the inverse 
function x~Vy, 

This method of interchanged variables is subject to very severe 
restrictions on the function over the range of values used. In 
the first place, must be strictly monotonic—i.e. uniformly in¬ 
creasing or decreasing—over the range of values given and the 
unknown value of v, the value of v in all practical cases lying not 
outside, but within the given range and near the middle of it. If 
Uj. is not strictly monotonic, the inverse function becomes two¬ 
valued at least—possibly many-valued—and hence cannot be 
represented by a polynomial. In these circumstances the ordinary 
methods of finite difference interpolation are unsafe. 

In consequence, before this method of inverse interpolation can 
be attempted we must have some extraneous knowledge, graphical or 
otherwise, of the nature of the function We must usually, in 
fact, be able to see roughly the position of x. These conditions 
being premised and being generally satisfied, the necessity for a 
sufficient number of values of Uj. and a small enough interval is 
naturally seen. 

5. The point here made is clearly brought out by a consideration 
of the following example. 

Example i« 

Obtain a value for x when — given the following values: 

X o 1 2 

Maj O I 20 

6-2 
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There would seem to be two possible methods at our disposal: 

(i) We might write down at once (if we think it safe to assume that 
second differences are constant) 

j = (I -f A)* Wo = ^0 + ^'^(1) + ^(2) ; 

i.e. 19 = a: + 9 —a:), 

so that gx^ -- 8:v — 19 = o, 

from which ^c=: 1*964... or —1*075.... 

(ii) Since we have to find an interpolated value of x corresponding 
to a value of y we might treat y as the argument and x as the entry. 
Let us write the data in the form 

y o I 20 

x = Vy o I 2 

and apply the Lagrange formula to calculate Vj^g as if for direct inter¬ 
polation. 

We shall have 

_7^9. ^ _ I _ .2_ 

i9.i8.( —i) 18.1.( — 19) ( —i).2o.i9 

or =2*8. 

It will be seen therefore that we have obtained two distinct sets 
of results. By adopting the first method x has the values 1*964... 
or — 1*075... and by adopting the second method x has the unique 
value 2*8. It remains to examine the reasons for the difference and 
to ascertain which result, if any, is more likely to approximate to 
the true interpolated value or values. 

In the first method it will be apparent that we have taken a 
curve of parabolic form, y = a + bx-\-cx^y and have obtained 
values of x corresponding toy =19. This gives two values of x, 
one on each side of the vertex of the parabola. In applying the 
Lagrange formula inversely we have assumed that x is a quad¬ 
ratic function of y and have given y a particular value (19) in the 
equation x — x + fy + yy^. If we substitute the value of y corre- 
spending to each value of x from the data, it is easily seen that 
a = o, ^3 = 398/380 and y=-18/380. The Lagrange equation is 
therefore igox == iggy - gy^. 



INVERSE INTERPOLATION 


85 


Now if the two curves 

y = — 8 ^ 


and 


199 

x = -^~ y- 
190-^ 


190*^ 


be plotted on the same graph, it will be seen that they take different 
shapes, thus: 



Fig. I. 

On the curve y = gx^ — 8 x the abscivssae of the points whose 
ordinates are 19 are 1*96... and —1-07..., whereas on the other 
curve there is only one point for which the ordinate is 19, namely 
the point (2-8, 19). Unless therefore the two curves obtained from 
the data, (i) by treating x as the argument and (ii) by treating y as 
the argument, intersect at the required interpolated value, as for 
example at K in the above figure, the two methods are bound to 
give different results. 


6. Although there would seem to be three different and possible 
answers to the question above, we must be very careful before we 
draw any conclusions from the results. 

In the first place, we are not told, and have no right to assume, 
that Ahij, is constant, and that consequently second difference 
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interpolation for y is sufficient. On the data as given the only 
conclusion Avould be that they are inadequate either for direct or 
inverse interpolation. 

Secondly, even if we may assume that A^Uj. is constant, we see 
that 

(a) a portion of the curve over the given range, namely from 
;x: = o to I, is non-monotonic and one of the essential conditions 
set out in paragraph 4 is thereby infringed; 

(ft) notwithstanding the assumption that second differences may 
be used for direct interpolation, we arc not justified in assuming 
that this order of differences is sufficient for inverse interpolation; 
we cannot therefore safely use Lagrange’s formula. 

If we make the assumption that second differences are constant, 
we cannot properly use the point (o, o) for inverse interpolation, 
as the given values do not indicate the form of the curve between 
x = o and x=i. By omitting the value (o, o) and using the values 
for jy when i, 2, 3 and 4, a more satisfactory interpolated value 
can be found. The student should attempt this by one of the 
methods described below. 

7. The formulae of direct interpolation are based on the properties 
of rational integral functions of the variable, and any formula which 
proceeds to nth differences gives exact results when applied to a 
rational integral function of the wth degree. By stopping short of 
nth differences the formula can, of course, be used to obtain 
approximate results, and the success of the interpolation depends 
on the magnitude of the terms omitted. Thus, if we use rth 
differences for a polynomial of the nth degree in x, the result is the 
exact value of terms up to and including the term in x^. The terms 
beyond are disregarded, and this can only be done legitimately 
if they are relatively unimportant. 

In questions of direct interpolation there is only one value of y, 
i.e. of Mjp, for a given value of x. There may be, however, more than 
one value of ^ for a given value of y. In fact, if jy is a polynomial 
in X, X is a polynomial in y only when both functions are of the 
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first degree. In other cases the inverse function may be an infinite 
series or an irrational function. For example, in the Table of 
Mortality there is only one value of for a given value of 4 
(where 4 represents the number of persons attaining exact age x in 
any year of time, and is the number of these who die before 
reaching age 1). In the neighbourhood of the peak of the death 
curve, however, there will be two values of 4 within a short range 
of interpolation for a given value in d^.. 

For these reasons the subject of inverse interpolation is more 
troublesome than that of direct interpolation, although it should 
always be remembered that the conditions attaching to the differ¬ 
ences of for direct interpolation are the same as those attaching 
to the differences of Vy for inverse interpolation. One principal 
condition is that within the range of interpolation there should be 
only one value of x corresponding to a given value of y. 

Let us consider the equation in Example i, namely 
j = — 8.V 4 * 9^*^, 

where the range of interpolation is from o to 2. The first point to 
note is that the function is not a good subject for direct interpola¬ 
tion except when the formula is applied to its fullest extent—the 
second degree. The reason is that the last term is the predominating 
term throughout the greater part of the range. 

In most instances, by altering the interval and reducing the 
range of interpolation, a function can be reduced to a good form 
for direct interpolation. Such a question as the following might 
be put: 

Given the function y = u^== + for what intervals of 

X should be tabulated so that in any interval an interpolated 
value of y can be obtained by first difference interpolation with an 
error less than, say, -ooi.? 

Put x = zla; 


Sz QZ 

= - + ^ 


i.e. 


—8az-\-gz^y 
—8^2+ 182' +9, 
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18, 

Suppose Vg to be tabulated for values of 2r at unit intervals. 
Then, for an interpolated value between and 

^z+( = + I ^ - I) 

If we take v^ + tAv^ as the interpolated value, there is an error 
i) and the maximum numerical value of U{t—i) is J, 
being the value when ^ is J. 

Therefore, by the conditions, 



It (t— i) A“2;^< -ooi. 


-001. 

But iSja-y 

/. 18/8^^ < *001; 

i.e. 

a'> 18000/8, 

i.e. 

> 2250, 

or 

a >^,^2250, 


and the most convenient value for a is 50. 

We must therefore tabulate at intervals of 5^^ of unity, i.e. at 
intervals of *02. 


For example. 


j —w^= — 8.V-I- 


A' 

z 


1*10 

55 

2*0900 

1*12 

56 

2*3296 

1*14 

57 

2-5764 

i*i6 

58 

2-8304 

i-i8 

59 

3-0916 

l'20 

60 

3-3600 


•2396 

•2468 

•2540 

•2612 

•2684 


Both for direct and inverse interpolation this table is better 
than one proceeding by larger intervals. It must not be assumed. 
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however, that a table which is good for direct interpolation is 
necessarily good for inverse interpolation: in particular, inverse 
interpolation always presents difficulties if Aw^/Ax is small. 

8. Practical methods of inverse interpolation. 

It is evident that the problem of inverse interpolation is the 
same as that of direct interpolation for unequal intervals. The 
methods of Lagrange or of divided differences can therefore be 
employed to obtain any intermediate value of x corresponding to 
a value of yy given a table of y~Uj.y by the use of the inverse re¬ 
lation x — Vy, Alternatively, we may treat the problem as one 
involving the approximate solution of an algebraic equation. Two 
of the methods often adopted in practice are given below. They are 
simple to apply and generally lead to satisfactory results. There 
are, however, certain objections to the methods—for example, it 
is difficult to ascertain tlie degree of accuracy which has been 
reached in the approximate answers—and for many purposes the 
most convenient and practical plans are probably Aitken’s methods 
of cross-means, to which reference is made in Chapter VIII. 

9. Successive approximation. 

In the first place we obtain either by inspection or by a rough 
graph two values of x lying on either side of the required inter¬ 
polated value. (For example, a value for x when y is zero in 
the function t = x“ —4x4-2, i.e. a solution of the equation 
x“ —4x4-2 = 0, lies between the values x=^o and x=i.) We then 
choose a suitable origin and unit of differencing so that if x be 
the interpolated value and lies between two successive values of 
the argument, the interval will be small and x will be as near to 
the origin as possible. 

Suppose that the required value lies between o and i. 

The method proceeds as follows: 
u^ — Uq-{-xl^u^-\-\x (x— i) A2wo4-Jx (x~ i) (x — 2) A*'^Mq4* .... 

Since x is small, a first approximation (aj) will be obtained by 
neglecting terms involving second and higher differences of Mq. 
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.+ aj Amq approximately, 
i.e. ai = (M^-«o)/A«o, first approximation. 

Again, neglecting third and higher differences, we may write 
Wr = Wq + ^2 Awq + (ai — I) A^Mq, 

where a2 is a second approximation and is therefore not very 
different from a^. This gives 


U^ — tL 


Similarly 


ao= 7 , I second approximation. 

A 7 /o+ Hai-i) AX 


a.o = -r 




A?^ + i (^2 - I) A^Wo + i (ag ~ I) (a. - 2) A'^z/q ’ 
and so on. approximation, 


10. Elimination of third differences. 

We have, as far as third differences, by expressing Uj, in terms of 

Wo, Awo, ..., 

w^ = Wq + xAwo + Tx (x — I) A“Wo 4 - -J-v (x - i) (x — 2) A^Wq. 

Also, in terms of w^, Awj, ..., 
u^-u^-\-{x- 1) Awi + i (x— i) (x —2) AX 

-f ^ (x-i) (x-2) (x-3) AX. 

If now we ignore the terms containing third differences and 
multiply both sides of the first equation by 3 — a and both sides 
of the second equation by oc (where a is an approximation to the 
required value, found by inspection or otherwise) and add, a new 
quadratic equation in x will be formed. The error involved in 
ignoring the third differences will be small, since 

(x~ i) {x-2) (3 --a) AX + 'e'^c (x- i) (x-2) (x-3) AX 
will be small provided that A%o and A‘'^Wi are not very different. 

11. The following question is solved by both these methods: 

Example 2. 

Find the value of x for which y is 18,600, given 

x 52 53 54 55 56 

y 19,231 18,868 18,519 18,182 17,85s 
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Changing the origin, the difference table is 


X y 

Ay 

A^y 

A^y 

0 i 9 > 23 i 

“363 



I 18,868 


14 


2 18,519 

-349 

12 

-2 


-337 


— 2 

3 18.1S2 

-327 

10 


4 17.855 




By the ordinary advancing difference formula 


18,600 = 19,231 — 363JI0 + 

14V (x—i) 

zx (x 


2 ! 


3 * 


where the value of x is required corresponding to the value 18,600 of y. 
(i) Successive approximation. 

Since x is small, a first approximation will be 
19,231 — 18,600 


3(>3 

Including the next term, 


or 17383.... 


.._.63I_ 

363-7 (at-l) 


or 17634-. 


Similarly, 


631 


’363-7 (“2-1) +J (“2-0(02-2) 


or I-7646.... 


The required result is therefore 52 + 1*7646... = 53*7646..., where, 
it should be noted, the last digit is uncertain. 

(ii) Elimination of third differences. 

We have 

18.600 = Ma. = (H-A)® Wo= 19,231 -363:v + 7:r (:x:- i)i) (x-z) 
and 

18.600 = «a; = (i = 18,868-349 (^-1)4-6 (x~i) (x-z) 

-i(*-l)(*-2) (a:-3). 
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By inspection a rough value of the interpolated value is 1*75, allowing 
for the change of origin. 

If, therefore, we multiply the two equations by (3 —1*75) and 1-75 
respectively, and add, we may neglect the fourth term. The factors 
being 1*25 and 1-75 we can use 5 and 7: we thus obtain 

12 X 18,600 = 5 [19,231 — 363 a ; + 7 a : (x— i )] 

+ 7 [18,868 -349 (.V -1) + 6 (a- - i) (JC - 2)] 

or 223,200 = 230,758 ~ 4419:^ + 77x2, 

i.e. 77x2 _ ^ ^ o 

Solving the quadratic, the value required is x= 1*7646..., which 
agrees with the value of in method (i) to four decimal places. 


12 . In Example 2 the advancing difference formula has been used; 
in practice, however, it is more usual for many reasons to employ 
a central difference formula as the basic equation. Central dilTer- 
ence formulae are more convenient because the coefficients are 
smaller and converge more quickly; this in itself is a decided 
advantage. Having placed x in the middle interval the first approxi¬ 
mation is generally much more accurate (thus, in Example 2, the 
use of Stirling's formula gives 

a,= I 4-268/356 

= i753)- 

Also, X is smaller, and the maximum number of significant digits 
which the data will yield may mean an additional place. 

Consider the following example. 

Example 3. 

Find the root of the equation —qx—14 = 0 wdiich lies between 
3 and 4. 

Let jv = x^-9x—14. Then we have, by actual calculation, 

X 3*0 3*2 3*4 3*6 3*8 4*0 

y —14*000 —10*032 —5*296 *256 6*672 14*000 
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The difference table is 


X 

y 

Ay 


A^y 

3-0 

-14*000 

3-968 



3-2 

-10*032 

4736 

00 

6 

00 

3-4 

- 5*296 

5-552 

•816 

0 

00 

3-6 

•256 

6*416 

•864 

•048 

3-8 

6*672 

7-328 

•912 


4-0 

14*000 





Taking the origin at 3*6 and using Stirling’s formula: 

Awo4- Aw_j AO X (x- — I) + A^w_o 

+ jc .-A + .. 4* -7 - -. - .— “, 

2 2 6 2 

the interval of differencing being 0-2; 
i.e. o = *256 4 - 5*984^:4--432^:24-*008^ 

The cubic equation can be solved by successive approximation, or 
we can repeat Stirling’s formula for the next value of Wg. and adopt the 
alternative method outlined above. 

If the first of these methods be adopted, it will be found that 
successive approximations to the value of x are —*04278, —*042913, 
— *042971. From the last we obtain as the required solution 

3-6 - (-042971 X-2) or 3-5914058, 

which is correct to six decimal places, the seventh being nearer to 7. 

If we choose our origin at the point (x, r) and the value of the inter¬ 
polated value is .x4-a, then, when a lies between — J and |, it is advan¬ 
tageous to use Stirling’s formula. If a lies between J and f Bessel’s 
formula should be applied. (Whittaker and Robinson, Interpolatioriy 
p. 60.) 

13 . In general, if Aw^ has n significant digits we cannot rely on 
more than n significant digits in x. Even then the last digit is 
doubtful and may in fact be misleading. In Example 3 above we 
have been able to obtain the fifth significant digit in *042971 from 
a four-figure value of A3; only because y and A3; happen quite 
unusually to be exact and not rounded off. 
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14 . The method of successive approximation is a convenient 
method for ordinary use. If we want a result that can be obtained 
to the required degree of accuracy by taking out differences as 
far as A%, or if the curve is a cubic, the elimination method will 
give a satisfactory answer. The disadvantage of this process is that 
we cannot approach our interpolated value by steps as is done in 
the method of successive approximation. Moreover when fourth 
and higher differences are not negligible the elimination method 
breaks down. 


15 . The general investigation of the accuracy of finite difference 
methods of approximation is a problem in direct interpolation, 
and has been dealt with previously. In dealing with the subject 
of successive approximation, however, it is of interest to include 
in the present chapter an elementary illustration of the fact that 
in certain circumstances a better interpolation can be obtained by 
neglecting higher differences than by retaining them. 

For example, if we have a third difference curve, then 

iix = Uq + )Awy + X(2)^hjQ + ^(3)exactly. 


The error (a) in taking two terms is .r(2)A‘‘^/^Q4 -a:(3)A"'Wq, 
and (^) in taking three terms is ;x:(3)A^Wq. 

(a) may be expressed as 


.•v:(x-l)(.y- 2) r J_ AX ■ 
6 _^x — 2 


A^z^q, 


and ( 13 ) as 


a: (^c — i) {x — 2) 
6 




Then, ignoring the sign of which will 

be the same for both (a) and (/S), (a) will be less than (^) if 

3 

—2 A%o 

is negative and numerically less than z. 
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In these circumstances the error made by retaining first diflPer- 
ences only is less than that made in continuing to second differences. 
As an illustration consider the function 50. 

It is easily seen that = = 6. If 


X is for example, 
than 2. 


3 A‘"z/ 

“—^ ~3/^4> which is negative and less 
X 2 ^ Wq 


(a) is therefore less than (/?). 

This can be otherwise shown by finding the values of the errors. 
U} - S'lit exactly. 

Also = (I + A)^ i/q 

= 2'o + J,A?/o -3^2A^W„+..., 

Mo+]A;/u=51.\ = 5i;:j, 

«o + = 50} il 


. («) 

and = . (^) 

(a) is less titan (/ 3 ), so that the approximation to first differences 
is better than that to second differences. 


16. In conclusion it should be emphasized that the accuracy of 
the result depends greatly on the fineness of the interval of 
tabulation. In cases of doubt it may be desirable to iialve the 
interval before applying the process of inverse interpolation. This 
will reduce first differences by about one-half, second differences 
in the ratio of about i: 4, third differences in the ratio of about 
1:8, and so on. The comparative effect of higher differences is 
therefore much reduced. 


EXAMPLES 5 

1. Given m, = 0, Wo = 1 12, Wg = 287, Wg ~ 612, find Using 
and M4, find a value for x when = 270. 

2. The following values of w^. are given: ^10 = 544, 1/35=1227, 
W20 = 1775* Find, correct to one decimal place, the value of x for which 
Ucr.=^ 1000. 
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3. Having given log 1=0, Iog2 = -3oio3, log 3 = *477i2 and 
log 4 = *60206, find the number whose logarithm is *30500: 

(i) by expressing log x in terms of log i and its differences and 
solving for x; 

(ii) by using Lagrange’s formula applied inversely. 

Explain the nature of the assumptions in each case. 

4. Apply Lagrange’s formula inversely to find to one decimal place 
the age for which the annuity-value is 13*6, given the following table: 

Age. X 30 35 40 45 50 

Annuity-value at 4! per cent. 15*9 14*9 14*1 13*3 12*5 

5. /(o) = i6-3S,/( 5 ) = I4-88,/(io) = i3-59 ,/(i5)=:i 2-46. Find x 
when / (x) = 14*00. 

6. Given the following table of / (v): 

/(o) = 2i7, /(i) = i40, /(2) = 23, /(3)=-6, 
find approximately the value of x for wLich the function is zero. 


7. The following values of / (;i^) are given: 

/(io) = i 754 , /(i5) = 2648, /(2o) = 3564. 

Find, correct to one decimal place, the value of x for which / (a:) = 3000. 


8. Given four values of a function ^3, show how to calculate 

an approximate value for x from the equation 


u^ = UQ-hxAt/Q-h 


X (x~ i) 


21 




A’ (x — l) 2) 


3J 




by obtaining a quadratic equation in place of a cubic. 

Use the method to find x when — given ^1.0=1 *0000, 
^1*1 = 1*0323, 1*0627, 2/1.3= 1*0914. 


9. Given that (i*20)*'*= 1*728, (1*21)^-1*772, (1*22)'^ = i*8i6, 
(1*23)''= 1*861, (1*24)''^= 1*907, explain carefully how to find the real 
root of the equation + a — 3 = 0 by a method of inverse interpolation. 
What method would you adopt in practice? Obtain a value for the 
root to four decimal places. 


10. The following table is available: 

Age X . 44 45 46 47 

//a; at 4I per cent. 13*40 13*16 12*93 12*68 

Find, to two decimal places, the age corresponding to an annuity of 
13-00. 
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11. Find, to two decimal places, the real root of the equation 

4 - Jc — 5 = o 

by means of divided differences applied inversely, using the values of 
the expression when x-o, i, 2 and 3. 

What is the reason for the poor result obtained in this case? 

(The true solution is 1*516 approximately.) 

12. The equation x^ — 6 x~~ii—o has a root between 3 and 4. 
Obtain it by inverse interpolation correct to three places of decimals. 

13. The formula for the value of an annuity-certain for n years at 
rate per cent, i is given by 

I — ? 

, where ?;? — (i 4-f)"^. 

Given the following table, obtain to three decimal places the rate per 
cent, for which <2001 ^4* 

Rate per cent. 3 3.^ 4 4J 

«20l. 14-^775 14-2124 13-5903 13-0079 

14. Solve the equation 10 logj^ .v, given the following data: 

Argument x; 1-35 1*36 1*37 1-38 

log'* ••• -1303 -1335 -I3l^’7 -1399 

15. Apply Lagrange’s formula (inversely) to find a root of the 

equation iiy. — o^ when 1/3^, - —30, — —3> ^43— 18. 


F M A S 


7 




CHAPTER VI 


SUMMATION 

1 . Certain series whose law is given or of which there are sufficient 
terms to enable the law to be assumed may be summed by the 
methods of finite differences. 

By definition we have 

f{a + h)-f{a) = Af (a) = <l>(a), say, 
f{a + 2h) ~f {a + h) — Af {a + h) = (f‘ (^ + ^)> 


f{a f nh) —/{a + n— ih) = A/(fl + w~ i/z) = (/> (a + w — i^); 

•*. f{a-\-nh)-f{a)-<l) {a)-h(f> {a +h)-h ...{a + fi— ih). 

If therefore f{x) is the function whose first difference is <j> (a:) 
we can find the sum of any number of terms of the series whose 
general term is <j> (x) in terms of values of /(^), for any given 
interval of differencing. 

By a suitable change of origin and scale we can make the interval 
of differencing unity and the first term of the algebraic series under 
consideration ^ (i). On putting a and h each =i, the required 
relation then becomes 

1 

The expression/(«-h i) —/(i) is sometimes written in the form 

[/.>r. 

This represents the process of substituting n+i and i for x 
successively in f{x) and deducting the second result from the first. 

Note. It 
and not to 


should be remembered that S <f} (x) leads to |/ 

/w 


6+1 
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2 , Although any function of x can be differenced, there is only a 
limited number of functions which are the first differences of other 
explicit functions. The principal forms of such functions are given 
below. 

(i) It can be easily seen that, since Aa^ = (iz— i) 



• -ig the function whose difference is 

a—i 

n 

We can therefore find 11 Uj. by the method above if is of the 
1 

form k(f\ 

(ii) The relations and enable the 

sum of any series whose «th term can be expressed in the factorial 
notation to be summed immediately. 


Example i. 

Sum to n terms the series whose .vth term is x (x — i) (x-2). 
Now i) 

and since Aa:^^> — 


n 

\ 



n+1 

1 




= l {n + i) n {n-i) (w-2), 


since the product of four successive terms of which i is the first includes 
o and is therefore obviously zero. 

If the interval of differencing does not happen to be unity the identity 
may be applied, but care must be taken in doing so. 
Here is the difference of the function so that in summing 

a series whose ^th term is, for example, zx (zx — z) (2:^ — 4), we must 
divide zx {zx — 2) (2.V — 4) {zx — 6) by /^ — 2 as well as by m — 4 before 
taking the limits w 4-1 and i. 

(iii) Any polynomial (x) of the wth degree can be expressed 
by an interpolation formula ending with A’^w^ in which each coeffi¬ 
cient is of the form X(^^) or J i)u)]- 


7*2 
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Since ^(r)==(^+ l)(r-fl)-^(r+l) 

= 

it follows that S.V(^) = :V(^.,i). 

Each term of the formula can therefore be summed at sight. 
Thus, the advancing difference formula 

Mj, + X(i) + X(2) A'^Mq + X(3) A'^Mj, + ... 

yields + x,,) Ai/„ + x^, A^ + X(4, A®m„ +.... 

The Gauss forward formula in its simple form is 

«x = «0 + •^’(1 + I )( 3 )^“» -1 + • • • 

and therefore 

= X(1) Mo + ^(2) AMo + X(3) A^M . 4 + (x + I) (1) A=m„i + .... 
Stirling^’s formula may be written in Sheppard’s notation as 

Mx = «o + ^(i)/^H+ 2 [(‘’J+ i)(2) + ^(2)] S‘«o-+ (•^■+ i)(3)p5X+ ; 

whence 

!::m^ = X(i)Mo + X( 2 )P§Mo+ A [(x+ i)( 3 ) + X( 3 ,] 8X + (^+ i)(4) •••• 

Example 2. 

Find the sum of n terms of the series o, lo, 33, 77, 150, given 
that third differences are constant. 

By taking out the differences it is seen that Aw = 10, A‘‘^w=:i3 and 
A3w = 8. 

The advancing difference formula gives 

M^ = 0+ IOX(i,+ 13X(2)+ 8X(3). 

n—1 

If the series is iiq, il^ ... ... then we require S 

0 

ri —1 n—1 

A S [ioX(,)+i3X(2) + 8x(3)] 

0 0 

= |^IOX,2)+ I 3 X( 3 )+ 8 X( 4 )J 
= I0«(2)+I3«(3) + 8«(4) 

= \n {n—i) (2«‘^+ 3^+16) on reduction. 
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Alternatively, we may continue the table backwards to obtain 
the differences required for the central difference formulae. 
Stirling's formula then gives 

Sm^== 7 - 5 X( 2 ) + 5 i)(3)4-a:(3)] + 8 i)(4) 

n —1 

and 2 <,. = 7-5«fa) + 2-5 [(« +i)u) + W(;i)]- 8 («+i)( 4 ) 

0 

— ^n (n — i) ( 2 ; 7 “ + 3 w-h i 6 ) as before. 

Should the series be given in the form 2 /^, ii^r ••• ^hen 
- I)( 1 )+ - I)(3) 

. •. = ( a; - I) (1) WjL + (A’ - I) (o) An I + (x - I) (3) -f (A - I) 4) A^Wi 

and 

n r nn+i 

^ “ I) (1) - I) (2) - I) (3) + (^ ~ I) (4 > 

1 L , Ji 

= w ( 1 ) Wi + n ( 2 ) A/Zj -I- n ( 3 ) A‘\ + «( 4 ) A^?q, 

which, on substituting for Az/^, A-z/j and A^?/,, gives the result 
obtained above. 


3 . Since Aa^ the series whose Ath term is of the 

form [(a+ i) (a + 2) ... (a + Zj)]-^ can be summed immediately. 


Example 3. 

Sum to n terms the series whose Ath term is ,-\ ^ “r/- 

(a+i)(a4-2)(a + 3) 


(a + i) (a+ 2 ) 


(A-hl)(A + 2) (a + 3)’ 



I 

(a+i) (a + 2) (A + 3j 


1 

2 


(ah- l) (a + 2)J 


1 

1 


1 I 

2 (w + 2 )(w+3 ") 



4. It is worthy of remark that the rules for the summation of series 
of the types given in paragraphs 2 (ii) and 3, as given in the text¬ 
books on algebra, are precisely the same as those stated above, and 
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are based on the same principle. For example, for a series whose 
nth term is 


{a-\-nb) {a-\-n+ib) {a + n-{-2b) ... (a + n + r~ i6), 

the finite difference method is simply to write this in factorial 
form, with interval of differencing and then to proceed on the 
lines laid down, thus: 


n n 



(tZ 4 - W -f ^ 


I 


/,v '•) r 

’ b\r+i) J, 

(a+ 71 + rb) 

i (r-f i) 


(rfl) 




constant. 


This, of course, produces the same result as is given by the 
algebraic rule: “Write down the /zth term, affix the next factor at 
the end, divide by the number of factors thus increased and by the 
common difference, and add a constant’’ (see Hall and Knight, 
Higher Algebra, p. 314). 

For the series whose wth term is the reciprocal of the one above 
the inverse factorial is used, and a similar result is obtained. 


5 . It sometimes happens that on taking out successive differences 
of a series a stage is reached where a particular set of differences 
forms a geometrical progression. In that event the series can be 
considered as consisting of two separate series, (i) a series whose 
general term is a + 6a: 4- cx^ 4-... 4- kx^~^ (a rational integral function 
of x), and (ii) a geometrical progression. 

Suppose, for example, that second differences are in geometrical 
progression with common ratio r. Then 
w^ = a4-6a:4-z:r^. 

For At/a. = 64 -c (r—i) r^, 

and = c (r — i = At®, 

where k = c (r—i)\ 

It follows that for nth differences 

^^u^=c{r-lYr^. 
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Example 4. 

Sum to n terms the series i, 6, 11, 18, 31, 58, 115, .... 
The difference table is 


1 
6 
11 
18 

31 

58 

115 


Third differences are in G.P. with common 
ratio 2. 

A^/^a. Assume therefore that 


5 

5 


o 




6 


^3 

27 

57 


H 


30 


u,^ ~ ahx^ ^' -f - ^ + dz^y 

A//, zcx'''^'^ + dz^ (z — 1) 

2 ~ b-h zrx^^^-i-dz^-y 

= zc + a z^y 

4 / S } h {^ r ^ dz ^, 

Inserting the differences for the value 
x=iy ha\e 

16 2 = A^f/j=2£^; o = A“Wj =2c + 2(y; 

5 = A?/j = Z? 4- 2C + 2^/. 

From these equations we find easily that 
d~i\ c = —i\ /? = 5, 


Putting a’-- I in -\-cxy-^ +dz^y we have 

I ~ u^ = a-\‘b + zdy 
whence a^i—b — zd— — 6 ; 


- 6 4* 4- 2^; 




- 6 a;( 1 ) 4 -* 


-v(2) v('^) 


-_j_ 

3 2-1 


-6 (w4-1)^^^ 4-64- 




3 


•2, 


since all the factorials except the first vanish for the lower limit. 
This simplifies to 

fi 

2”"^^ — 2 - ^ (2W-— 15/2 4-19). 


Alternatively, we may proceed thus: 

A^w^ = dz^. 

Deduct dz^ from and difference the function tt^ — dz^' in the 
usual way. 

By giving x the values i, 2, 3, 4 in succession it is easily seen that 
u^ — dz^ takes the values — i, 2, 3, 2 respectively. On forming a differ- 
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ence table we find that the leading differences are 


Then 

30 that 


A {u^ - dz^) = 3, A2 (m^ ~ dz^) = ~ 2. 

K^ = 2*-i+3 (:v-i),i)-2(a:-i)|.„ 


1 


2^ - (x - l),i) + 3 (x - l),2) - 2 (x- l), 


(3) 


n+1 

1 


= 2"'-^ - 2 - «(1, + 3n,2, - 2«(3), 

which, on simplification, gives the same result as that above. 


Note. In the methods above we have called the terms Wo* ^3 ••• 
and have used the property = We may shorten the work 

slightly by beginning with x — o (so that all the factorials vanish at the 
lower limit) and using instead of x^^K Further, the two methods 
give the same result without transformation. It will be instructive for 
the student to rework the problem on these lines. 


6. The form Summation by parts. 

When the general term of the series is the product of two 

n 

functions of x and the value of each of the summations X and 

1 

n 

E is known, a method known as “summation by parts” can be 
1 

adopted. 

We have A ( U, FJ = F,,, - U, V, 

= U,^,AV,+ F,AUy, 

F,At/,=A(f/,F,)-C/,+1 AF.; 

n r “1 ^*^+1 ^ 

^[F,AC/,] = [c/,F,J^ -:S[C/,+iAF,]. 

It follows that when the function can be put in the form 

n 

F^AC4 the summation can be performed at once if E [f4+iAFJ 

1 

can be evaluated (but not otherwise). 

Note. For extensions of the formulae for summation by parts see 
Chapter VII. 



USE OF THE OPERATOR E 


105 


Example 6. 

Find the sum of the series « + + + « terms. The 

terms are successive values of the function y = and since 




we may 


write * for and for in the relation above. 


1 1 


xA 


a-i 


a* 

- X 

a-i ji 


n+l n 
_ V 


,x+L 


a- I 

a-I ‘ 


n+l n 

—s 
1 1 

rj-fl 

1 

-11 ril 


a-1 

“tH 

a —I 
(a- !)■■“ 


- Ax 


, since Ax = I, 


n+l 

1 

,n+2 


a a” _ 

(h+i)^----_- (a-if 


7. The result in paragraph i, namely, that h <j> {x)—f{n+ i) -/ (i). 
where Af{x) = <j> (x), can be obtained by the use of the operator E. 


We have 


since 


XI (x) = <^ + ( 2 ) + ••• + (^0 

^ =(i+E+E- + ... + E'^--^) </>(!) 

E”-i 


■ E-i 


<^(1) 




E-i 

4 (i)^Af{i) = iE-i)f{i). 


S<^(*) = (£"-i)/(i) 

=/(« + i)-/(i). 
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n j^n _ j 

Thus the operator S is equivalent to the operator , and 

1 L — i 

jgr» I n 

we may safely substitute ~ for S in any series of operations. 

n _ j 

Again, since — the identity = -zq can be 

1 h — i 

expressed as 

V I 

L u^- - 

1 


E-i 
1 


K-n-Wj) 


8 , The relation between the operators S and A. 

It has been seen that if 

Af{x)^cf>{x) 

then f{x)^^cl>{x), 

where the summation is performed between certain limits. 

If therefore we omit the limits we may say that with certain 
reservations summation is the inverse process to differencing. 

Consequently <f> (a:) = A/ (x) = AD^ (a;), 

so that AS=i, 

i.e. 

A 

Now although AS=i it does not follow that SA=i, for we 
shall obtain the same result by differencing f{x) + Cy where c is 
a constant, as by differencing / (jc) alone. 

Thus A [f {x)-{-c] = <l> (x); 

f(x)-\-c = A-V (x) = ^<f> (x) = SA/ (x), 

so that SA^i. 

The symbol 2 may be and often is used in place of the inverse 
symbol A“^ provided that it be remembered that 2 (or A~^) and 
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A are not commutative, and that in the indefinite finite integral we 
must include, or imagine, an arbitrary constant which disappears 
in the definite integral. 

Thus, the process of summation in finite differences is similar to 
the corresponding process in the integral calculus and the relations 
between the symbols are analogous to those existing between the 
symbols of differentiation and integration. As a result, finite 
difference summation is often referred to as finite integration. 

n 

SWa; is said to be the indefinite finite integral of u^ \ S u^. the definite 

1 

finite integral; and a function that can be integrated, such as to 
be ‘'immediately integrable’’. 


9, Other uses of the symbol H. 

One of the commonest functions in the theory of life con¬ 
tingencies is the expression obtained by multiplying 4 (the number 
attaining age x) by the interest factor This product is denoted 
by Z)^, and the connection between certain functions dependent 
on Dj. is indicated thus: 




: SN 


Here S denotes summation from age x to the end of the mortality 
table, it being understood that values beyond the end of the table 
are zero. 

In point of fact, the correct way of showing the relation between 
the functions D and N, etc., is 

nA... 

t-0 


where x is fixed and t is the variable. 


In modern mathematical works it is now usual to use the notations 

cjo n 

21 , 2 etc., where the variable t is specified for the lower limit only. 
b 

The still shorter form 2 is often used where there is no doubt about 

a 

the variable. 
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(i) When 2 is used in the special sense 

= + + the end of the table; 

= (^x+l + “x+2 + +•••)“” \ 2 + • • *) 

~ ~~^Xi 

SO that here AS = — i . 

(ii) If, however, S is specially defined so that St/^ indicates 
a summation beginning with and ending with the last term 
preceding then 

S//^ = Wi + «2 + ^^3 + • • • + ^^x- \ y 
= (z/j + Wo + ^^3 + • • • + w^) — (t/j + 1^2 + ^3 + * • • + ^x-l) 

= W^. 

In these circumstances therefore AS = i . 

(iii) Again, in algebraic series, S is often used loosely to indicate 
the sum of the first x terms of a scries, thus: 

Sw3. = w, + Wo +... +Wj.; 

ASw^ = (Wj + Wo + ... + w^^i) - (Wi + W2 + ... + w J 
= w^^x 

whence AS = E. 

These illustrations serve to show that great care must be ex¬ 
ercised in introducing S into any formula. The sense in which it 
is to be used should be clearly defined in every instance: the safest 
course is always to state the limits where possible. 


10. Application of the relation between S and A. 

By treating the operator S as equivalent to A"^, the method of 
separation of symbols can be employed for the solution of problems. 
For example, a convenient formula for the evaluation of Sw^w^ can 
be evolved by which the necessity for the continued application of 
summation by parts can be obviated. 


Example 6. 

Prove that 




wA w^A^ w^A® ] 
{a—iY (w-i)®^‘”j 
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Now = a!^ \^aEY 

and Sa^M^j = omitting the arbitrary constant 

= (£*~i)-i a^u^. 

Let (/> {E)=Aq + A^E+A^E^-\-..,+A^E^ + ,,., 

Then the (r + i)th term in <f> (E) is AyE^a^Uj.. 

I.e. = a^A^a^u^^r 

= a^Ara^E^u^ 

= a^Ar (aEy u^\ 

/. (j> (E) {aE) 

so that if (j> {E) is the operation {E— i)~^ 

{E — 1 )~^ {aE — i)~^ ; 

(aE -1 )~^ 

= a^ (a + aA-i) 


= a® (a — I )“^ I + 


£zA 1-1 


f (i\ rt“A“ 

1 L a -1 (a~ i)- J 

omitting the arbitrary constant. 

Note. An alternative (and simpler) proof of this formula can be 
obtained by the use of the expansion of in ^ scries. See Chap. VII, 

paragraph 12 . 

Example 7. 

Apply the above formula to evaluate 1 ^ 3 "^ (x^ + .v- + a; + i). 

A//^ = A (jc^-(-.v“ + x 4 - i) = 3^^'f 5.V + 3, 

A“W^ = A‘^ + + i) = 6a: + 8, 

and A'^w^^ A'^ (r^ + V‘^ 4 -^+ 1 ) — 6 ; 

23® (^ + a;2 + a:+ i) 


(3-1) ^"*■^(3-1)^^'"^' 


= -- [x^ + + a: + I — f (3^2 + 5X 4- 3) + 1 4- 8) — 6] + £: 


=( 4 X^ - I 4 x‘'^ 4- 28 x - 23 ) 4- c. 
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It is often of advantage to set out the rational integral function 
in factorial or binomial notation; the successive differences can 
then be obtained with little difficulty. 

Thus, by the method given in Chap. I, paragraph 22, 

x^ + x--hx+ I -f + + I, 

so that = 34” 3, 

and, by adopting the formula for we have easily that 

+ .r- + -V + I) = - - ~ 2x^ ^^4-1 - 23) 4- 

o 

To express —23 in the form 

ax^ 4 - bx^ -{-cx + dy 

we use the method of detached coefficients applied inversely: 

2 I 4 -2 18 I -23 

. o_8__ -10 J 

I 4 —10 ; 28 

; ° _ 4 J 

o I 4 ; -14 


Hence 

S3'^ 4- .v- + X 4 - 1) = V (4:^^ — 14^:^4 - 28.V ~ 23) -P Cy 

o 

as before. 

11. The following examples are instructive. 


1 x(:v4-i) 


Example 8. 
Evaluate 
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u =-2^±3_i. = |3 L_1JL 

* x(x~hi)3^ jjc x+i) 

= 3 JL. 

a: 3* x-h I + i 



_ II 

7/ +1 3"^^* 

Example 9 . 

Show that the general term of the recurring series 

UQ-hUjX-hU2X--h.,.+UrX^-h 

for which the scale of relation is i —px — qx\ is Aa^ + Bh'^^ where a, h 
are functions of p, q and Ay B are constants. 

Since i —px — qiA is the scale of relation, 


i.e. 

“n = 0, 

or 

O 

II 

Cl 

1 

1 

Therefore 

{i-aFr^){i-bE-^)u,,^o 

if 

a-{-b=^p and ab= —q. 

This will be true 

if either 

i.e. if 

(i --aE~^) or (i —bE~^) 

«n - ««n-i or u„ - hu„_^ = 1 

Now if 

«n-««n-l=0. 

then 

llyy — aUy^_Yy 


and the series is a geometrical progression with common ratio a. The 
general term of the series is therefore /la”. 

Similarly the general term of the series is Bh^y where A and B 
are constants. But if a new series be formed by the addition of these 
two progressions the relationship will hold good for this new series. In 
other words the most general solution is 

—Aq^B b^ y 

where we may give A and B any values, but a and h must satisfy the 

equations , ^ j l 

^ a^b^p and ab=^ —q. 
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Example 10. 

If fourth and higher differences are ignored, prove that the sum of n 
successive terms of a function, of which Uq is the central term, is 

nUo + ^2s(n^-n) 

where n is odd. 

Since Uq is the central term, it will be convenient to use a central 
difference formula. 

Gauss’s forward formula gives 


iij. ~u^^ + 4- lx — i) + A^i/. j 

= ?/() + +1 (x + IA^w_i; 

= C + xu^^ 4- Ix^^^ Az^ 4- A^z/^j 4- '/f (jc 4-1A^z/.^. 


On summation between the limits (w — i) and I (n — i) the co¬ 
efficients of Az/q and A^z/_;i will cancel, and we shall have 


= 

~i(n-l) 


C 4- xUq 4- lx^“^ Az/() 4- Ix^^^ A^zz_.i 4- 2^ 


(.r4-i)‘^^ A*’'z/ 


i(n4-l) 


^4-_i _ / ” - I \ 

2 ^ 2 ^ 


^0 


I 



/z 4 -1 n — 

1 

^ w — i^z4-i«4-3\ 

2 ’ 2 

2 ' 

^ 2 ' 2 ‘ 2 /_ 


A2ZZ..1 


- nu^^ 4- oV “ ^0 A“Z/_.p 


12. The use of symbols of operation in the summation of alge¬ 
braic series. 

Many forms of algebraic series which at first sight do not seem 
to lend themselves to summation by the method of separation of 
symbols can in fact be summed very simply by that method. 

For example, if be the coefficient of in the expansion of 
(i 4 -a;)^', where « is a positive integer, the sum of the series whose 
rth term is ( —t>e WTitten down almost 
at sight. The series is 

CQa — c-^ (zz — i) 4-C2 (z3! — 2 ) “ ... 
i.e. --CiE~^a + C 2 E~^a — ... =(i —E~'^Y ^ 

= A^ {a — n) 

= 0 , for all values of a. 
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Again, consider the well-known series 
^ _ _£ l _ ^ ^2 _ 

X x-i-l X-\-2 
This may be written as 

(cq — CoE ^~ • • •) 

= (i ~E)^ x~^ 

X (x + 1 ) {x 2 ) ... (:v4-n)* 

The following examples are illustrative of the method. 

Example 11. 

Evaluate 

I n n(n — i ) 

x-hi (a:+i)(jc 4 * 2 ) (jc+i) (jc + 2) (jc + 3) (a:+ i) ... (^ + w +1)* 

This series is ’^n (n— i) ...+( — 1 .)^ «! 

=+ ”, Ax< -» + - + ... + ”; A";«:<-i) 

I! 2! n \ 

= (i + A)” 

= (JC+ «)<-'!) 

_ I 
'~(a: + «+i)* 


Example 12. 

Find the value of 

a"+i -n{a + 6 )’«+i + (a + 26 )"+! 


2 ! 


-i- jf - ’ {a + 2hr'-^-.... 

Taking the interval of dilferencing to be b, the series may be written as 


= (i-£')"«"+! 

= (-!)« A"a"+i. 


FMAS 


8 
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Now 

^n+i ^^(n+i) a^n )^^ terms in 

,\ A= (w 4 -1)! -f In (n + 1 ) 6”"^^ n ! 

= (n 4- 1 )! (za 4* nb), 
and the sum of the series 

= H ~ + i) • (2^ + ^^)* 

Example 13. 

Sum the series 

^^4-(^4- i)^4-(Jc4-2)^4-... 4 -(:!c4-w~' i)^. 

The series may be written in the form 
5=(i4-iE4-i^^-f ...+£:^-'')^ 

— A~^x^ 

= A~^ (a; 4 - n)^ — A"^ 

= A~i [{x 4- w)(i) 4- 6 (ac 4 - «)( 2 ) + 6 (a: 4- n)^)] 
-A-4A?(i)4-6A[^(2)-b6^(3)] 

= J [(a!:4-«)^ {x-^n—iY — x- (a;— i)^], on reduction. 

Note, It is unnecessary to introduce any constant of integration: for 
if we put 

A“^ac^ = Wa.4-^, 

the constant disappears on operation by ~ i, 

13. “Summation w”, or [«]. 

An interesting example of the development of a series of opera¬ 
tions by the method of separation of symbols occurs in the theory 
of graduation. One of the objects of graduation is to obtain a 
smooth series of numbers instead of the rough series given by the 
actual data. A step to the solution of the problem consists in 
replacing each term of the series by the arithmetic mean of the 
n successive terms of which the given term is the central term. The 
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operation of summing these successive terms is generally denoted 
by [n] (“summation w**). For the present we will assume that 
n is odd. 


For example [5] Uq = u_2 + u_i + Wq + 


[w] Wq — W n-l n- 3 + ♦»« + 3 + Mrt-1. 

2 ~~ 2 2 2" 


Consider a simple summation: [3] Uq 
By definition [3] Uq = u_-^ + Wq + Wi, 

and if we write Vq for [3] Wq we may operate again on Vq to obtain 

[3] 

In that event we shall have 


[3] ^0= [3] «-i+ [3] «o+ [3] Wi 

= M _2 + 2 M_i + 3^0 + 2«1 + M2 

= Mo + (m_i + Mo + Ml) + (m_2 + U-l + Ko + «1 + “ 2 ) 

= [i]«o+[3]«o+[5]«o- 

If therefore we denote the double operation [ 3 ] [ 3 ] Mo by [ 3 ]^ Mo, 
we have the symbolic identity 

[3?sW + [3] + [S]- 

Similarly [ 5 ]^= [i] + [ 3 ] + [S] + [?] + [ 9 ]. 
and [«]^ = [i] + [ 3 ] + [ 5 ] + [ 7 ] + • • • + [ 2 w - 1 ]. 

where n is odd. 

The identity between [ 3 ]“ and [i] + [ 3 ] + [5j can be seen at 
once by writing down the terms in diagrammatic form: 

[3]* “0= [3] «-i+ [3] «o + [3] «1 
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These results are general. The sum of n consecutive odd opera¬ 
tions is [n] [r], where [r] is the middle operator. E.g. 

[5]+[7]+[9]+[I I]+[13] ^ [5] [9] - [9] [5]- 

14 . We can express [n\ in terms of the ordinary finite difference 
symbols thus: 

For a simple value of w, say 3, 

[ 3 ] Wo = W_i + Wo + Wi = 3 W 0 4- W_i - 2Uq + Wj 
= 3Wo + A2w_i 

= (3+^‘") Wo> 

where S“ is the symbol denoting the second central difference. 

Generally [w] = «H-8^ +terms in 8^ and higher differences 

^4 

if these exist. (Cf. Ex. 10.) 

The relations above are on the assumption that n is odd. If 
n be even we must find a meaning for the summation symbol. 

By analogy [2] Wq is the sum of two values of u whose suffixes 
are such that their sum is zero and their difference unity. 

I.e. [2] = + 

Hence [2]^ «„ = {[i] + [3]} 

= (4 + 82 ) Ho, 

so that [2] s 2 (I + 

= 2 (i+|82-j 1 ^ 8 ^+ ...), 

which is otherwise obtained by expressing w_j + wj in terms of 
central differences. This agrees with 

M = w+ ~ 82 
as far as third differences. 

The meaning of [n\ when n is even is now evident, and we need 
no longer restrict the values of n to odd integers. Thus the formula 

[w] Uq — W^n —1 + W 72 - 3 +...-}* Wn- 3 + Wn— 1 

2 ”2 '2 “2 

applies for any integral value of n whether odd or even. 
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*15. The following alternative form for exhibiting relations 
between the operators, whether odd or even, is due to Mr G. J. 
Lidstone: 



L [ ] 1^0* [4] I/| + [4] u_| +[4] ttj + [4] 

• [4] [41 



i i i i 


l«o“[3]“-| + [3]“-4 + l3]“| +[3]«* 

- [4] [3] 

The tracks marked-show the re-grouping. The diagram 

is of virtually the same form if we begin later: 



2[]«o»[ 33 [7l«o- 


16. By means of the relations already proved we may develop an 
unlimited number of formulae involving [n], [rn\, etc. 
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Example 14 . 

Prove that lo [i] — 3 [3] =2 [3] — [5] as far as third differences. 
{10 [i]-3 [3 ]}"o = ioM(,-3 ( 3 «o + |iS2Mo) 

= «o-3SX- 

Also {2 [3] - [5]} Mo = 2 ( 3 "o + -(5M0 + 

= “o-3SV 


Example 15 . 

Given [5] [5] z/q, [5] Wg, find ii^, fourth and higher differences 

being neglected. 

By Stirling's formula, 

+ A Wo . A 9 . - i) A^w.j + A^w^g . 

M, = Mo + * --5 + + .. ^--+ ..., 

Am. 1 + Amo ** . , a; - I) A®m_i + 

u.x = Uq-x ---+ — A2«_i-^ -i~--- +.... 

” 2 2 0 2 

Mj + m_j, = 2Mo+a;=A2«_i. 

[5] «0 = «-2 + M_J + Mo + Ml + Mj = (Mj + M_ 2 ) + (Mj + M_i) + Ug 
= 5Mo + sA2m_i. 


Similarly [5] m_b + [5] = ioMo + i3sA2«_i. 

Eliminating A^_i, we have 

125Mo = 27 [5] Mo-[5] M_5-[5] Mj. 

An interesting note by Mr D. C. Fraser on the properties of the 
operator [«] occurs in the Actuarial Students' Magazine, No. 3 
(Edinburgh, 1930). Here the general form for [w] Mj is given by 
Mr Fraser as 

n n 

E^-E 


which defines the operation whether n is odd or even; and the proof 
of the identity [3]* = [i] + [3] + [5] is made to depend on the development 
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of the operator E. Thus: 

“■'""'irrFsM"' 


(E^-E-i) (E-^ + E^+E^) 

(£i -Eri) + (£t - E-i) + (£§ - E-i) 

Ei-iri ’ 


= [i]«o + [3]wo + [5]«o- 

The general formula for [w]^ u can be proved in the same manner. 
Many further examples of the use of [«] will be found in various 
papers in Journal of the Institute of Actuaries. For simple extensions 
of the method see particularly Hardy, J.I.A. vol. xxxii, p. 371, and 
Todhunter, J.I.A. vol. xxxii, p. 378, and for generalizations, Lidstone, 
J.I.A. vol. LV, p. 177, and Aitken, J.I.A. vol. lx, p. 339. 

EXAMPLES 6 

Sum the following series: 

1. 7, 14, 19, 22, 23, 22, ... to w terms. 

2. 2, 12, 36, 80, 150, 252, ... to n terms. 

3. 10, 9, 7, 4, o, ~5, ... to 30 terms. 

4. 5, 10, 17, 28, 47, 82, ... to 20 terms. 

5. 10, 23, 60, 169, 494, ... to 7 terms. 

6. I, 2, 4, 8, 17, 40, 104, ... to w terms. 

7. I, o, ~ I, o, 7, 28, 79, ... to 2 k terms, 

8. 10, 14, 10, 6, ... to w terms. 

9. 125, 343, 729, 1331, 2197, ... to n terms. 

10. I, o, I, 8, 29, 80, 193, ... to 17 terms. 

Use the methods of finite differences to sum to n terms the series 
whose Ath terms are 

11. (jp + 3) (a; + 4) (:v + 5). 12 . x{x + 2){x + ^). 

13. (3JC-2)(3«+i)(3Jc + 4). 14. a:(ar+i)(a:+3). 
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l6. (2X -f 3) (2X + 5) (2X + 7) {2X H- 9). 

(^+i)(^ + 3)‘ 

I 

( 3 ^“ 2 ) (3^:+i)(3A?4-4)* 


22. Sum to n terms 2.4.8.14 + 4.6.10.16 + 6.8.12.18 + .... 

23. Obtain the general term of the series 

— -”- + — ~ ^ +... 

1.4.10 4.7.13 7.10.16 

and find the sum to n terms and to infinity. 

n 

24. Find S Uy.y where u^ —x {x2) {x 5). 

1 

25. The two series 6, 24, 60, 120, ... and o, o, o, 6, 24, 60, 120,... 
are given. Find the sum of n terms of each of the two series. Compare 
the results and explain the difference. 

26. Sum the series i, 5, 17, 53, 161, ... to w terms. 

27. Evaluate A~^ {x’^a^). 

28. Show that A (.x:!) = a; (^!) and hence sum to n terms the series 

1+3.21 + 7.3!+13.4!+ 21.5! + .... 

71 

29. Find S ttj, where m^ = (32:-i) (32:+ 2) (3*+ 5) (3*+ 8). 

1 

30. Sum to n terms i.3*^ + 3.5^+ 5.7^ + 7.9^ + .... 

31. Obtain the formula 

a+n —1 

S t<^ = n(i)«a + «,2,AMo + «(8)A*Ma + ... 

a 

and use the formula to find the sura of n terms of the series 
-8, -5, o, 14, 44,.... 

n 

32. Prove that S ( 4 (:®+1).*!=«.(« + !)!. 


15. :e(*+i)(* + 3)(* + 4). 

I 


17 

19. 

21. 


(* + 3)(* + 4)‘ 

X 


{x + 2){x + 2)lx + ^y 
x + 3 _ 

X ( a :+ i ) {x + 2y 
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33. Sum to n terms the series whose jjcth terms are (i) 0^ (3^ + 2), 
(ii) 2® (x^ + x) by finite integration. 


34. Evaluate E 




+ .... 


1 (( a ; 4-1) ( a :+ 2) 

35. Sum to infinity — -1- — -1- ^— - 

2.6.10 4.8.12 6.10.14 

36. Find the sum of n terms of the series 

2 2^ 2'^ 

37. Obtain the indefinite integral of (2;c— i) 3®. 

38. Show that the series 10, 24, 61, 163, 452, 1290, 3759, ... can be 
split up into two other series. Find the two series and hence sum the 
original series to n terms. 

39. Prove that A~^ [Wa.z;J = M3,A~%a; —A~^ and apply 

this formula to find the sum of the first n terms of the series whose rth 
term is (r+ i) 

wL - 1 


40. Evaluate 


[x (x-\~2) ( a ; + 3) (^ + 4)1 


41. Find the sum of the squares of the first n natural numbers by 
the method of finite integration. 


3 c 7 

42. Sum to n terms + ~—cf~z +1 


2.5.8 5.8.11 8.11.14 




13 'X ^ 7 

43. Find the sum of the infinite series — - 4 - 4- ^ 

2 2^ 2^ 

44. Find the sum of n terms of the series 

2.2 + 7.44-14.84-23.164-34.324-.... 

45. Prove that s' = + 


I —X 


X^ 


+ 7 


(AX-*"AX) + - 


Apply this formula to find the sum of the first n terms of the series 
whose rth term is r (r +1) x^~^. 


r xi 

46. Evaluate A“^ 2®.*.;-ri • 

^ I (2*+I)!_ 
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47. Use the method of finite integration to obtain the sum of n terms 
of the series i .3® + 3.S^ + S.7^ +.... 

48. Find the function whose first difference is ax^ + bx^ -{-cx + d, 

49. Prove that i^ + 2^:v + 3^:v“ +... is a recurring series, and find its 
scale of relation. 

50. If w,, is the «th term of the series i, 2, 3, 5, 8, 13, ... in which each 
term after the second is the sum of the two preceding terms, prove by 
the process of mathematical induction or otherwise that 

51. Show how the methods of finite differences can be employed to 
find the sum of a series of the form 

{a + ky + {a + 2ky + ,..-^{a + xky +.... 

52. Sum the series 

I! 2! n\ 

a-fi (<2+i) (tz-f 2) **’ (a+i) {a + 2) ... (« + ;/)* 

r=n 

53. Find the value of S n(r)r\ 

r— 1 

54. Prove that {2 [3] - [5]} - 3approximately. 

55. Obtain the approximate formula 

i25«o = [5]^ {wq +Aw_ 2“-Az/j}. (Woolhouse.) 

56. Prove that [w]^ — [my = {[n] — [w]} {[«] 4- [m]}. 

57. One of Hardy’s graduation formulae is 

{«o + Aa_2 -Amj} = Mo- 

120 

Prove that this is approximately true. 

58. Express {[3] + [s] “ [?]} «o in terms of u_^, u^, Mj, u^, and 

hence prove that 

[ 5 L^ 3 ] __ reproduces Uq to third differences, 

59. If = [5] prove King’s formula: 

Uq = -zwq - •oo8A2«;_5, 
fourth and higher differences being neglected. 
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MISCELLANEOUS THEOREMS 

1. In this chapter it is proposed to treat of certain propositions 
and applications of finite difference methods which are not essential 
to a first reading of the subject and which may conveniently be 
dealt with at a later stage. Some of the theorems are developments 
of familiar processes: others are alternative methods of approach 
for the solution of problems involving the principles of finite 
differences. 


DIFFERENCES OF ZERO 
2. If in the identical relation 
AV = (E'-i)^ 

= + —n(i) + i)"^ + W(2) + ... 

we put x — o^ we obtain 

- W(i) (n + W(2) (w - 2 )"^ ~. 

By continued application of this formula we can obtain values of 
/^n^m xvhen a: = o for all integral values of n and m. 

For example, if ;w = 3, 

[AVL^ = 2*-2.I» = 6, 

[AV]a^=33-3.2*+3.iS = 6. 

The values of [A'^x'^]^^q are known as “differences of zero’*, and 
in accordance with this definition the expression is often written 
as A^o’". 

It is evident that a table of values of differences of zero can be 
constructed if we can obtain a relation between corresponding 
values of A^o’”. 
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We have from the above 

l)"^ + W(2) (w —2)"‘— ... 

= n - (n - I)(1) (« - i)”»-i 4 - (n - i)(2) (w - 2)"*-^ - ...] 

= w [(i +n — 1 )"*“^ —(w — i)(i) (i + n~2)”*“^ 

+ («-i)( 2)(H-”-3)'"'‘^--"] 

= «[(£■- i)”-i 
= wA^^~^ i”»~^ 

= wA"“^ 

= wA"'-i (14-A) 0^-1 
= n (^A.n-iQm-1 

Alternatively, 

_ 2)"* — .... (fl) 

But, since 

An^m = (jc 4 - w)"^ — W(i) (;v 4 - n — i)"^ + W(2) (x + n — 2)"^ — ... 
we may put n—i for «, m — i for m and i for and obtain 

_ I^w-l ^ 2)'”“^ — ... 



= - A^^o^: 
n 

= «A’^~^i"*~i = w (A^-Iq^-i 4- A^o^^-i). 

This proof is given by de Morgan. 

Another method for obtaining this relation, depending upon the 
formula for the wth difference of the compound function u^v^y is 
given below (paragraph ii). 

3 . Since A'^x'^ —n ! for interval of differencing unity 

Ar*o^t =n!. 

Similarly, since 


i.e. 


A^+r^n ^ A^+*’0^ =0. 

^UQtn when n>m. 
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We can now build up a table of differences of zero by continued 
application of the relation given in paragraph i above. 

The table is 


m 

Ao"* 

A*o” 

A“o”* 


A®o”* 

ACo»» 

I 

I 

0 





2 

I 

2 

0 




3 

I 

6 

6 

0 



4 

I 

H 

36 

24 

0 


5 

I 

30 

150 

240 

120 

0 

6 

I 

62 

540 

1560 

1800 

720 


and so on. 


4. An interesting application of the use of the differences of zero 
for the calculation of the coefficients in an expansion is as follows. 
The fundamental formula 


can be written as 

^(2) ^(3) 

Ux = «0 + ^^0 + -Tf + -TT + • • • • 

2. 31 

^(2) v(3) 

x'^ = 6^^ + 1)Ao'” + — + —T A'^o"' -f.... 


2! 


. f 


By use of the relation 

A^o^'^ = n 1), i.e. -4- = , 

n\ n\ {n — i)\ 


a table of the coefficients in the expansion of x'^ in terms of succes¬ 
sive values of the factorial x^^^ can be written down in a similar 
manner to that given above. 


5. The differences of zero have many special properties, and they 
are used in higher work in the theory of series. A simple example 
involving their use is given below. 

Example 1 . 

If (i+x)^:=Cq + CiX+ n being a positive integer, find the 

value of / NO / NO / NO 

(M-i)2q4-(«-3)2r3 + (w~5)“ ^5 + .... 
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Now (* -1)** Cl + (a; - 3)® C3 + (AC - 5)** Cj + ... 

= (E~^Ci + E-^c^+E~^c^ +...) a :® 

= i [(i +£-i)"-(i a:® 

= + 2)" - A"] £-”a:® 

= H(A+2)'‘-A”](Ar-K)® 

= i [(A + 2)” — A"] o® when x = n 
— I [2"0® + K2 ”“^Ao® + in (« — I) 2”“® A®o® +...] 
= 4 [n2"“* 4-n (n — i) 2”“®] 

= 2"“® n (n + i) on simplification. 


RELATIONS BETWEEN THE OPERATORS 
D AND A 


6. It is shown in Part I (p. 65, paragraph 5) that 

f{x)+hfix)+f^r w+- 

converges to the limit f{x-^h), provided that Lt (a:) is zero, 

71-^ 00 

where 

Rn{x)^-,^ {x + eh). 


This is Taylor’s theorem for a convergent series, and we may write 
it in the following form: 


= Wx+ rhDu^ + Dhi^ + +. 


i+rhD+ D® + ^ D®+ 
2! 3! 




_ fiVliD, 




But, for interval of differencing A, 


«x+rft = j^“x* 
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SO that hD = log £ = log (i + A) 



_A2 A® 

A^ 


= A 


-+ ... 



2 3 ' 

4 


^ I 

r* A® 

A® 

A« 



«*+y “ar- 

-“x + 

4 


If= I, then (i-fA)^ = e^^ 

and we have a result analogous to the important relation in the theory 
of Compound Interest, namely = where S is the force of 

interest corresponding to a rate of interest 2. 

7. Since ^ = t -1- -h... , 

^1234 j 

_ I A 2 A* A^ 12 

JD 2 = 1 A-+-+ ... 

h-\_ 234 J 

= ^|-JA2-A»+AtA^-§A®+ ...]. 

Similarly, [A® - |A« + JA* -... 

We have therefore a convenient method for expressing the 
differential coefficients of a function of x in terms of the differences 
of the function. 


Example 2. 




T r// 


the number of 

(the force 

of mortality) 

= _ where L is 

Ldx * 

persons at exact age x in any year of time. 

Given the following table, 

find a value for 





Age .v .. 

50 

51 

52 

53 

4 

73499 

72,724 

71.753 

70.599 

The difference table is 




X 

h 

A/, 

A'4 


50 

73.499 

- 775 

—196 


SI 

72,724 

- 971 

13 

-183 

52 

71.753 

-1154 



53 

70,599 
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. ^ 

dx 


2 3 / 


since the interval of differencing is unity 

= ■”775 + 9S-f4*333*--» when :v = 50, 
= —672*667. 

r I di^ 672*667 

•** -tj^\ =-00915.... 


L 73,499 " 

Note. If A=f= I, must be taken to divide by the appropriate power 
of h in applying the formula. Thus, if is given for quinquennial 

I 

intervals of x^ ® will be -3 —-|A%j.4-...) and not simply 

dx 5 


8. The result 


AX-IAX + .... 


A2 A^ 
w.-f — u^- 

2^3 


can easily be obtained by the differentiation of the advancing 
difference formula for u^. 

Taking the simple case when A= i, 


W3, = Wq + + 


'(^-0 a 2„ , a’(x-i)(a:-2) 


AX+-. 


du^ . 2X-I 3 x ^- 6 a -’+2 

= Amo +- + -—-7-A*m„ +... 

(ix ” 2 “ 6 


/_ A „ _ 4. _ 


which becomes, on changing the origin to x, 
dx ^ 2 3 


Similarly, we can express in terms of central differences. 
For example, Stirling’s formula is 

Awn + Aw_i X (x^—i)A^u^i + A^u o 

U^ = Uq + X—^ -^ + --, A2z/„i + -A. - 1 -z:? ^ 

^ ” 2 2! 3! 2 
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Differentiating with respect to x: 
du^ 
dx 


^ ^ A» m_i + A =»«_2 ^ ^ 


•■• (&L— 


w 1 — 


as far as third differences 
[«2 - 3«1 + 3«o - «-i + «I - 3“o + 3“-i - “-al 


= - (mj — M_i) — - (2/2 — m_ 2) ori simplifying. 
3 

Changing the origin, we have 

. si/ S 

Tx ^ 3 “ l2 

A first approximation will evidently be 

dn.j,^ u.xn ~ u^-i 
dx z 

or, if the unit of differencing be A, 


zh 

This simple approximation can be seen quite easily from a con¬ 
sideration of the geometry of the figure. 
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Let P be the point whose coordinates are {Xy u^) and let Pj, Pg be 
the two points whose coordinates are (x-^hy and (x — hy 
respectively. 

Then if PK be the tangent at P 

duy. 

—," = tan a 


= ^ nearly 

P,Mi-LMi 

_ Kr 1 h ~Pjr- h 

zh 

9. Another formula, giving the differential coefficient in terms of 
central differences, can be obtained from Bessel’s formula. 

S) + * A%,., + AX 


(x-A).v(j;-i) 


A3«_3 


Changing xtox + ^y 

„ «0 + Wl , ,.A„ , 

„ = — + -- ,-- ---- 


'1 ^ 


Differentiating: 

S“'H-a 

If ;c = o, 


A2if„j + A%o (x^ I \ AO 

+ ■- +(7“24) 


d . A^w_i 


Changing the origin to a; — i we have the approximation 

An interesting discussion on the calculation of the values of differential 
coefficients of a function by means of selected values of the variable will 
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be found in T,F,A. vol. ix, pp. 238 et seq, (G. J. Lidstone). By means of 
tables of coefficients prepared for convenience of practical working 
Mr Lidstone evolves formulae for the values of the successive differential 
coefficients, both for advancing differences and central differences. 
In addition, alternative processes are given for the values of the deri¬ 
vatives when the intervals are unequal. 

*10. It should be borne in mind that, in choosing a formula for 
the expression of the differential coefficient of a function in terms 
of its differences, central difference formulae are to be preferred 
to advancing differences. These formulae possess smaller coeffi¬ 
cients and have the additional advantage that alternate coefficients 
vanish. Further, a greater degree of accuracy may be obtained by 
the use of a suitable central difference formula. 

Mr G. J. Lidstone has pointed out that if for any reason it is 
decided to use the relation 

considerable increase of accuracy can be secured by the simple 
and easily remembered device of ending with instead of 

^ We bring in for the purpose of obtaining omit 

otherwise use the formula as before, beginning with 
Aw^. This reverses the sign of the error and reduces it numerically 

in the ratio i: (;z4-1). Thus, ^ A'* — A’^+^ is replaced by 

n n 

It is assumed that A^+^w, though not negligible, is not varying 
greatly. The proportionate improvement increases with n, and if 
n is at all considerable, we attain approximately the accuracy of a 
further order of differences with no additional work. 

The same process may be employed for the second and third 
differential coefficients, after which the improvement, though 
present, is not proportionately so marked. 
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THE COMPOUND FUNCTION 

11 . We can adapt the principle of separation of symbols to the 
evaluation of such expressions as by a simple extension of 

the process of differencing. 

Let Ai, Ej, 2j, ... denote operations on alone and Ag, Sg, ... 
operations on alone. 

Then = 

= (^”1^2-1) 

^'“U^ = (£'1-1)« u^v^. 

By expressing in terms of A, and Ag we are enabled to 

obtain expressions for the expansion of 
First, if « = I, we have 

[(l + 4 ) (l +-^2)- 1] 

= (^1 + '^2 + ^i4)Wx^'x 

= (^l + ^ 2 'fc’l)«x^x 

~ T ^2 Ej Uj. Vj. 

= ^x‘^1«x + 'E^iMx-A2^x 
= ^x^1«x + «x+iA2^'x. 
or, dropping the suffixes, 

which is otherwise evident. 

Again, 

^"“x^x = ('^1 + ^iA2)” «x®x 

= (Ai” + W(i)Ai’’-^ A’j A2+n(2)Ai"-^ jE'j^A2^+ ...) Uj.v^, 

which is easily seen to be 

®x^X + ”(l)-^^'x‘^"~X+l + ''(2)^^x^”~®«x+2+ ••• +Mx+nAX- 
If in the above expression we put and v^=x, we have 

A» (jxni’-i . x) = xA”x”‘~^+nA"“^ (ic +1 
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THE COMPOUND FUNCTION 
Let x=^o\ then 

_ fif^n-X jm-1 — £6^-^ = fl -f A^O’"”^), 

which is the relation proved in paragraph 2. 


12. The ordinary formula for finite integration by parts is 

Y.ii^Vx = Ux^v,j^ — ^ [Aw^S«>^_^j]4-an arbitrary constant. 

In cases where the formula has to be applied more than once 
we may use the series found below. 

S = A~i= (A,+A2 + AjAo)-^ 

-[Ao + AiCn-A.)]-^ 

^A^-i [i+AiA^-Mi+Ao)]-^ 

= A^-i [i - A, A^-^ (1+ A2) + A.^A.-^ (i + A^)- -...] 

^ A^-^ - Ai A2~2 (i + A 2 ) + Ai^A^-^ (I + A.)^- -... 

- Ao-i-A iA2-2£’2 + Ai2Ao-‘‘^£’2^- .... 

On dropping the suffixes and inserting the appropriate functions, 
we have 

= + •••• («) 
By an alternative treatment another formula is obtained: 

SMx ‘^x = «x-l - ^"x-2 + A2 m^_ 3 X'\v - • • • • (*) 

As an example of the use of a series for 'Lu^Vj, we may prove the 
formula for ^a^Uj. given in Chap. VI, paragraph 10 . 


Example 3. 
Prove that 




or 

a— I 


aA 

+(^''Z'iy“(^_7)3+ -r 




a— I 

and so on. 

Let — Vy> in series (a). 


rr 

« , S3«*+2 


rCC+a 


{a-if 


(a-1)3’ 


S«a*+3= 


a®+3 

i^if' 
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Then 


= Uj. 2a® — Awa; S^a®^^ + A^w^* 2-‘^a®+2 — 2^a®+^ + ... 


“•'a-1 ^“*C/ 7 -T^ 2 +'^*“*r 


«x+2 


a®-^ 


,x-l 3 


^(a-i)2 


__ ^ A®2/__ 

(a-1)3 


a® 

a— I 


^ aA . a-^A^ 


a^A^ 


a-1 (a—i)^ («~i) 


, + ... r M, 


13. The following is a further illustration of the application of the 
above method. 


Example 4« 

Prove that 

X" {u^v^) = I.”v^ -«(1)A«^ S"' 1 + (n + i),2) A%, S” ^^ 

- (« + 2)(8) ^''+X+3 + • • •• 

Now Au^v^ = {A^ + A-^{i+A^'\u^v^. 

:. S" = A-" = [Aj + Ai (I + Aj)]-" = Ag-" (1+ Ai Ag- 17 -2)-", 

.-. SX«. = A.>-’' [i -«<i,A,Ag-i£'g + («+ i),g, Ai^Ag-^'iV-! 

- (” + 2)(3) Ai^Ag-^SgS + ...] K^t.^ 
= — n^i)Aug. + (w + i)(2 ) A^Mj. 

- (”+2)(3) AX 2 ;"+x +3+• • - 

If n = I we have 


AiWjZJj, — Ag ^ (i + A^Ag ^-/ig) ^ 


= Ag-Mi 


AiAg-ii'g 


I +A1A2-X 


^ Ag + AiAg' 

_ /^ A —1 


= - A-i (Aa* 

= a^A®*-A(Aa^So*+i), 


the ordinary formula for summation by parts. 

The subject of finite integration by parts is treated more fully in 
Chapter VIII. 
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FUNCTIONS OF TWO VARIABLES 

14 . When and y are independent variables, u,j.yy f{x,y), ... 
represent functions which assume different values according to 
the values of x and jy. An alternative notation is Uj..y or f{x:y). 
For example, the function 

a:- + 2xy +y'^-\-x + 3_y, 

in which x and y both vary, may be written shortly as either 
or u^-y. If jv is a function of jc, we may reduce Uj^y to the form 
and thus obtain a function depending on x alone. 

Now suppose that x is changed to x-\-h and that y is changed 
to 3; + ^ while X remains constant. Then the new value of the 
function is dependent on x-\-h and + It is not necessary for 
both X and y to vary: x may become x -f- h while y remains constant 
or vice versa. 

If the values of the function proceed by equidistant intervals, 
we have the following scheme: 


^^x:y 

^x+h:v 

r :y 

^^x+3h :y 

^x\y+k 

^x+h:yi-k 

^x^2h,y+k 

^x+3fi:y+k 

^x:vi-2k 


^x+2h\y^2k 

^x+3h:y+2k 

^x\y^-2k 

^x+h:y 

^X\-2h\y-'rZk 

^x+3h:v+2k 

or, if our 

origin be (0, 0) and 



^0:0 

^1:0 

«2:0 

^3;0 

^0:1 

Wi:i 

W2:l 

^3;l 

^0:2 

"1:2 

^^2:2 

«3:2 


15 . If we are to apply the processes of finite differences as hitherto 
defined we must distinguish between an increase in the value of x 
and an increase in the value of y, Wc therefore write to denote 
the operation of increasing the value of x: by a unit difference while 
y remains constant, and Ey similarly for 3; while x remains constant. 
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That is, £'^Mq.q = Mj.q and = 

so that A^Mo.o=Mi:o-Mo;o and \Uo,o = Uaa-Uo-.Q- 
Again, Aj^ ^0 :o ~ (Aj, .q) 

“ Aj, (Mo:1~*^0;o) 

• ~A^Wq;1 Aj.M0.q 

and Aj.*Aj,Wq.q = {u^)■l — Mo:o) 

= AxX:l-A>o.O 

= (« 2 :i - 2M, :i + Mo:i) - (M2;0 - 2 K, :o + Mo:o)- 

The general formula corresponding to the advancing difference 
formula for one independent variable is 

^m:n ~ d” A*)’" (l + A^)” 

= (i +W(i)Ax + m(2)A,.H...) (i +W(i)A^ + n(2)A/+...) Wo-o 
= (i + m(j) Ax + /«(2) Ax^ + W(.„ A/ -i-... 

+ «(!) Ay + W(, )«(i) Ax Ay + m(2, «(i) Ax^Ay +... 

+ «(2)Ay2 + ?«(,,n(2,AxAy2+... 

+ W( 3 )Ay®+...)Ho:o 

= «0;0 + ('«a)Ax + «(l)Ay) M0:o 

+ (»^2)Ax*+M2(i)n(i)AxAy + «(2)Ay2)?^0.o 

+ ('«(. 3 )Ax® + »M{2)%)Ax^Ay + W(j)«(2)AxAy2 + n(3)Ay») Mo:o+ •••• 

It often happens that a certain order of differences, say the rth, 
is sufficient for interpolation along the x-line, while a higher order 
is necessary for the y-line. This is especially so when x is a young 
age and y an older age (see p. 137, Ex. 5). In this case, the formula 
can be simplified by omitting all terms involving Ax'' "m whether 
standing alone or combined with values of AyW. For the same 
reason it may be desirable in tabulating to use smaller intervals 
for y than for x, according to the nature of the function. 
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16. Application of the formula. 

Example 5 . 

Given the following table of values of estimate the value of 

^23: 17 * 


X 

3^=15 

y — 20 

J' = 25 

20 

5-947 

4-418 

3-547 

25 

6-046 

4-530 


30 

6-144 




Here the interval of differencing is 5. Changing the origin to the 
point (o, o) and the unit to i, the data are given for the points (o, 0), 
(i, o), (2, o); (o, i), (o, 2) ; (i, i). The value required is w.6:.4. 

Differencing downwards for values of etc., we have 

Aa:Wo:0 = -099; -*OOI. 

Differencing across for values of AyWo:o» etc., 

^j/«o:o= -1-529; Aj,X:o = * 658 . 

Also A^A,,Wo:o=-^'i:]-^o:i-Wi:o + ^^o:o=-OI 3 - 

w.e :.4 = (I 4 - *6A^- • 12 A^" .•.) (i + * 4 ^v- *1 2A,^^ 

= (I 4 ‘6A^ 4 * 4 ^ 1 / - - 1 2A/ 4 •24A^ A^ - • 12a, 2 ) : 0 

= 5 - 3 i 9 - 

Note, Here it is not really necessary to use see end of 

paragraph 15, 

17. While for most purposes the formula given above is con¬ 
venient, special circumstances may arise in which other methods 
may be more suitable. Where the intervals are not equidistant ve 
may apply either a method of divided differences or one of various 
adaptations of Lagrange’s formula depending upon the number 
of points given. If, for example, four values of are given, 
namely then it is quite easy to show that 

„ -,J I ,, (y-a) 

^■v (a - /J) (tf - 6) “•*’ (a - 13) \b - a) 

-o.) lb-a) '5'“ (/3 -a) {a-by 

If more than four values are given the formula becomes un¬ 
wieldy. It is seldom necessary to interpolate except between 
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equidistant values of the function, and in that event a form of 
advancing or central difference series is generally preferable. 

Two-variable functions are of great frequency in actuarial work. 
Tables of annuity-values depending upon joint lives are often 

available for quinquennial values of x and y, and when values 
at ages other than those tabulated are required recourse must be 
had to methods of interpolation. Although the formulae given 
above are of general application special methods can be found to 
meet the requirements of the problem to be solved. 

P"or example, if quinquennial values of a^^.y are available, and if 
the two ages concerned are such that their sum is a multiple of 5, 
we may choose our origin and interval of differencing so that 
We have then, from the general formula for 

^x\l~x ~ ^0:0 ~ (:v — i) (Aj.^ — zA^. A^ + A^^)] Uq.^ , 

i.e. 

-X = Wo:l + (Wi:o - ^0:1) + ~ l) (^ 2:0 - + ^0:2). 

Again, if jc+y =2, this formula becomes 
^x:2-x — \x{x—l) U 2 :Q — X {x — z) Mj-i + \ {x—l){x~2) Wq.2> 
or, on changing the origin, 

==l(x-l)(x- 2 ) Wy-o -x{x- 2 ) Wi:_i + ^X (x - I) W 2 :- 2 > 

for which the data required are u ^. q , Wj and W2:-2- problem 
is thus reduced to a single variable interpolation. 

This second formula is very useful in practice. As a rule we can 
choose our data within wide limits, and it has been found that 
with certain functions the three-term formula gives as good 
approximations to the true results as do formulae involving higher 
orders of differences (see Spencer, J./.. 4 . vol. XL, pp. 293-301). 

The general second difference formula of which the above is a 
particular example is 

^x’.rx ~ WWq.q 4" WWj-f- PU2-^2r> 

and in the note referred to above, Spencer gives a table showing 
the application of this formula according as r takes the values 
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Another form of the formula for an interpolated value of when 
four values are given is — where 

X and y are both less than unity and =jy-f 7; = i. The second 
difference formula can be written as 

where and hy^u are second central differences with respect to x and 
y respectively and and are factors depending upon the values of 
X andjv (Buchanan, 1 \F.A. vol. x, pp. 329, 330). 

Example 6. 

Values of the joint-life annuity for quinquennial ages being avail¬ 
able, find a value for ^^41:51. 

(i) Take the origin at (40:50); then if the interval of differencing 
be 5 years, (44:51) will be represented by (-8:-2) and x~hy = i. 

= + ^ + (^^2:0 ~ ^ + ^0:3). 

The data required are 

^40 : 56 ~ ^40160 = ^‘ 9 ^^ ^45:50 ^^' 7^3 

«50 : 60= 10-202 ^45:55= 9*^54 

Then w.«:.2= 10-135 + -8 (10-763 ~ 10-135) 

+ 4 *8 (-’•2) (10-202-19-708 + 8-926) 

= 10-135+ -5024 +-0464 
= 10-684. 

(ii) Take the origin at (40:45) so that (44:51) will be (-8:1-2) and 
x-hy=z 2 , 

^x: 2 -x=i (^-2) Uo: 2 -X {x-z) + (^“l) "2:0- 

The three values required are 

^4o:65 = to-135, 

^45 : 60 ~ ^0*763, 

« 50 : 45 = 10-763. 

W.8:i*2 = i (~‘2)(-I*2) 10-135--8 (-1-2) 10-763 + ^ (•8)(“*2) 10-763 
= 10 - 688 . 

If nine values surrounding the point (44:51) be taken and a Lagrange 
formula for these nine values be used, the value for <244 :54 becomes 10-684. 

This nine-point formula is a safe formula for occasional interpolation, 
and by its use the risk and labour attaching to the calculation of differ¬ 
ences may be avoided. The formula is used centrally, the area of inter- 
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polation being as shown in the diagram below. The ordinary single¬ 
variable Lagrange interpolation formula is used to interpolate for x in 
each column, and the formula is used again to interpolate for y from the 
three calculated values. 

y -y 

X 

,00 o 


o 



o 




o 


o 


o 


This process is veiy^ simple with the aid of a machine, especially for 
quinquennial intervals, when the coefficients are as follows: 


Interval 

«-i 

«0 


“•4 

— *12 

+ *84 

+ •28 

— *2 

-•08 

+ •96 

+ •12 

+ •2 

+ *I2 

+ •96 

— *08 

H- *4 

4 * *28 

H- *84 

- *12 


18 . The above example shows that different degrees of accuracy 
may be obtained by choosing different sets of data on which to 
work. The general theory follows the same lines as that for single¬ 
variable interpolation. It will be remembered that the ordinary 
advancing difference formula may be applied to the expansion of 
Uj. in terms of the differences of u^, on the assumption that y = Uy. 
is a rational integral function of x. In these circumstances we may 
represent the function graphically, and the successive values of .v 
and y will be points on the plane curve y — u^.. When we are 
considering a function of two variables x and y wc assume similarly 
that we may represent z — as a surface. Now in Chapter V 
(paragraph 15) it was proved that the effect of including higher 
differences in the expansion for does not necessarily give better 
results than if they are neglected. In the same way it may be 
shown that by choosing more points on which to work we may 
exceptionally produce a result farther from the true value z on the 
surface ^ — ^han we should obtain by relying on fewer data. 

With regular data the formulae with y in the central area of the 
given points are usually preferable. In the space for which a:, y are 
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both positive and less than which includes all cases if the origin and 
direction of the axes are suitably chosen, a simple central difference 
formula is , , x 1 / v 

: 1/ = Wo: 0 + Ja: (Ml: 0 - W_1:0) + { u ^: 1 - Mo :.i). 

This is based on five points. 

This formula and the six-point formula consisting of the same terms 
with the addition of 

(M-1 :0 - 2«0:0 + : o) + 1 )’^ (“o: 1 “ 2 K„; o + «0 :-l) 

are probably the most useful interpolation formulae for ordinary 
actuarial purpovses (Todhunter, J.LA, vol. Lin, p. 89). In some cases 
either the first or the second additional terms may be omitted (see end 
of paragraph 15). 


* 19 . Taylor’s theorem for a function of a single variable, 

/(x+A)=/(^) + A/'(.v) + ^-,/"(x) + ..., 
may be written symbolically as 

/ (jc + A) = f {x) (see paragraph 6 above). 

d d 

Denoting as usual partial differentiation by we have 

/(x-h/i,y~hk)==e 

— e e ^^’f{x,y) 

= e ^^f{x,y) 


I'his formula is of theoretical interest. For use with tables the 
partial differential coefficients must be replaced by differences, so 
that in effect the formula repeats the difference formula already 
given. 

Thus, for quinquennial intervals, 

^ h l+k I 

(i-f AJ^(i4-Aj,)^-^ 
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so that ^ log (i +^x)+j log + = ^ 


[A^A,*+2AAA,A, + A;^A/] + .... 


INTERPOLATION FORMULAE: FRASER’S 
HEXAGON DIAGRAMS 

20. No demonstration of interpolation formulae would be com¬ 
plete without reference to Fraser’s graphic method. In this 
method the ordinary differences of a function of x are combined 
with the relation (a:4-i)(^) = .V(^)4-a:(^_i) in diagrammatic form so 
that by adopting certain conventions any finite difference formula 
can be written down immediately (P'raser, jfJ,A. vol. XLiii, pp. 
235 et seq.). 

We have (.v + /+i)(r) = (x + f)(,) + (A; + ir)(^_i), 

or (X + /+ l)(r) —(a; + 0(r) = ('^ + 0(r-l)- 

A relation similar to the fundamental finite difference identity 
^x+h~'^x —exists therefore between these coefficients. If we 
carry the analogy still further we can construct a table of values of 
(x-f-O(r) corresponding to a difference table. 

The tables are set down in reverse order thus: 


M_2 

Am_2 



(x + 2 )(2) 

('^ + 3 )( 3 ) 

M_1 

A2 m_2 


(*+i)(i) 


(x+2 )(3) 


Am_i 

A^W_2 

^(0) 

(■*^+0(2) 


«0 

A2m_i 


A^W_2 ^(1) 


(X+I)(3) 


Am„ 


(^- i)(o) 

^(2) 


“l 

AX 




■*(3) 


A«x 



(^-0(2) 

«2 
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If now these two tables be combined, we have the following 
scheme, where, by convention, jC(o), {x— i)(q) ... = i : 



where for any one of the hexagons we may write in general 


(» + 0(r+l) 





Now 

(x + i- i)(,) A''M..,+ (x+i)(r+i) 

= (x+i —l)(r) [A’‘M_; —A'‘m_;_^x] + A’‘+*M_^ [('*' + 0(r+l) 

-(•^ + ^-l)(r+l)] 

= (x+f -1) (,) [ - A-'+J«_(] + A''+i«_, (X + f - I) (,) 

= 0 . 

A relation is therefore established between the constituents of 
the various hexagons. If we make the following assumptions: 
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(i) the oblique lines denote multiplication and the horizontal lines 
addition; (ii) a line taken in a right-hand direction gives the product 
a positive sign, and in the opposite direction a negative sign, we 
can say that the sum of the operations performed in travelling round 
any hexagon is zero. It follows easily that the sum of the operations 
in travelling round any closed circuit is also zero. 

It is evident from a consideration of the diagram that if we travel 
from any value of (^ + 0a-) to any difference the result will 
be the same whatever route be taken. For example, from (x: — i)(o> 
through Wq, X(i), Au^^, X(2) to A^Uq and (by completing the hexagons) 
back along (^-i)(2), Awj, (a;-2)(i), (^-2)(o), to (^~i)(o> 

again enables the following identity to be established: 

{x- i)(o)Wo + ^(i) Awo + X(2) A^Uq-{x- i)(2) AhiQ-{x-2)(^) Awi 

-(^-2)(o)M2 + (^-2)(o)«1-(^-i)(0)«1 = 0. 

Re-writing this, we have 

Uq-\-xAuq-}-^ x i) — \ (x- i) (^ — 2) A^Uq 
or -(x-2) + = 

Wq- h xjAwq +-iv (a:— i) A“W(, = W2+(^*-2) AwiH-| {x — z ) (a:— i) A^w.^. 

Exactly the same result will be obtained by proceeding along an 
alternative route 

(^X l)(o)> ^(l)> ^(2)> ^*^^0 

and back through 

(a:-i)(2 ), Aui, (x'-i)^). 

The identity will be 

-(x-l)(o)Wi = 0, 

or xAu^^ +\x (x— 1) A^u^^=\ {x— i) {x — z) A-Wq 

i) Awi + z/j; 

i.e. =1 {x— i) (x-z) A-Uq-{-{x — z) Au^+ti^y 

the same result as before. 

21. Application of the hexagon diagram. 

The above example gives a formula for Wg in terms of Wq, Auqj Ahi^^ 
and Auiy and if we put a: = 2 we have a well-known identity. A 
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similar process will give a formula for , and since we may take 
various routes a number of different expansions of will arise, all 
giving exact expressions for u„ . It should be further observed 
that when an nth difference has been reached by travelling along 
the upper route the terms other than u,^ in the lower route will be 
zero, and it follows that by travelling round any circuit we obtain 
expressions involving an initial term and terms of lower degree 
than n. This is seen to be so by considering all the coefficients 
along the lower route will contain (x — n) as a factor and will 
therefore vanish when x — n. 

We have therefore from the diagram the following expansions: 

(i) w^ = 2 /o 4 -n(i)Awo + «(2)A^z/o4-W(3)A®Wo+••• (Newton’s formula). 

(ii) u„ = Uo + n^^^AuQ + n^2)A^u_J + {n + l)^gyA^u_J 

+ («+!) (4)A'*w_ 2+••• (Gauss’s forward formula). 

(iii) w,, = + n(j) Aw_i + (w + i) (2) + (w + i) (3) + • • • 

(Gauss’s backward formula). 

(iv) = Wj + (n- i)(i)Awo + ^(2)A^ + «(3)A^w_i 

+ (W+ i)( 4 )A'*W_ 2 + .... 

The mean of (ii) and (iii) gives Stirling’s formula, and the mean of 
(iii) and (iv)can be arranged to give either Bessel’s or Everett’s form. 

22. Mr Fraser has shown that the hexagon diagram can be used 
for divided differences in a similar manner to that given above for 
differences at equal intervals. (D. C. Fraser, Newton and Inter¬ 
polation —a Memorial volume issued by the Mathematical Associa¬ 
tion, 1927.) 

A typical hexagon involving divided differences is 



10 


FM A S 
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and the construction of the system follows closely the lines of that 
for ordinary differences. 

The A diagram is a special case of the A diagram, since ordinary 
differences are themselves divided differences when the given 
values correspond to successive integral values of the argument. 
The results in the preceding paragraph could therefore be obtained 
by a consideration of the hexagon diagram for A, the modifications 
necessary to produce the simpler forms of the central difference 
formulae being introduced at the last stage. 

OTHER SYMBOLS OF OPERATION 

* 23 . It has already been shown when considering the common 
operations of finite differences, Aw, Fw, Sz/, that the symbols 
denoting the operations can, within limits, be treated as obeying 
the ordinary algebraic laws. By omitting the function u the various 
processes can be applied to the operators alone, with a resultant 
simplification of procedure. For example, the method can be 
adapted to the needs of the infinitesimal calculus, and Dr Aitken’s 
introduction of his 6 operator has produced an interpolation 
formula of extreme generality (see Chap. VIII). Other operators 
have been devised for special purposes connected with finite 
differences, and an interesting example is given below. 

* 24 . The operator V. 

In Chapter IV attention was drawn to certain symbols of opera¬ 
tion which may be considered as supplementary to the A and E 
which are the basic operators in finite differences. These symbols 
—namely 8 and /x —may also be assumed to follow the normal 
algebraic laws (with the usual limitations), and the method of 
separation of symbols may be applied to them equally with A, E 
and S. A further symbol has been introduced connecting with 
the next lower value instead of with the more usual value 
This symbol is V, and is defined as — 

Corresponding to Au^ = (£* — i) , 

we have therefore Vw^ = (i — E~^) . 
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Thus, for example, 

In addition to the familiar 
there is a similar relation 

- m) _ -m+1) ^ 

where ™( — i)"^ ( —and not the inverse factorial defined 
on p. 19; and if we denote the product 

X {x\n — i) {xhi — 2) ... (x— \n + i) 
by x^^^^ it is easy to show that 

(Stefiensen, Interpolation, pp. 8, 9). 

No new principle is involved in dealing with these further 
symbols of operation; their introduction simply enables us to 
develop expansions and to write down formulae for interpolation 
with a minimum of labour. 


OSCULATORY INTERPOLATION 


*25. It may happen that we know the values of at intervals of a 
unit, and that we wish to calculate a complete table of values with 
smaller intervals. For example it is a common practice to calculate 
every fifth value in a life-table, and to complete the table by 
interpolation: here the unit interval for the preliminary calculations 
is five years. 

If we decide to use a third difference formula then every inter¬ 
polation involves four of the given values. For the interval o to i 
the best course is to base the formula on the four values w_i, Wq, 
Wj, thus giving equal weight to values on either side of the 
interval. An appropriate formula is Bessel’s formula, namely, 


«x=i K+«l) + (^ - i) ^«0 + 


3! 


to-s 
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This is the formula for the interval o to i. For the interval i to 2 
we should use the corresponding formula based on the four given 
values tto, z/j, Wg, %• 

Thus for the two intervals o to i and i to 2, the two interpolation 
curves have a common ordinate when x=i; they may not 
necessarily have a common tangent. Two neighbouring interpola¬ 
tion curves will usually cut one another at the point of junction, 
and while there will be a smooth run of values within each unit 
interval the values will not run smoothly with those in the next 
interval. 

We are led therefore to enquire whether we can find a series of 
curves of interpolation which shall have common tangents as well 
as common ordinates at the points of junction. Such curves are 
said to have contact of the first order at the points where they join, 
and the necessary condition for this contact is evidently that 

and must have the same values at these points on the one curve 

as on the other. 

The interpolation curve given by Bessel’s formula (i) for the 
interval o to i was based on the conditions that it should have 
w_i, Wo> ^2 ordinates. We retain the condition that Uq and 
should be ordinates and abandon the condition that and Wg 
should be ordinates. Instead, we shall stipulate that the diflPer- 
ential coefficients of when x — o and when x — i have known 
values. To fix these values we shall proceed as follows: 

The interpolation curves for the two intervals — i to o and o to i 
are to have the same tangent when x = o as the curve of second 
degree which has Uq and Ui for ordinates. The equation to this 
curve may be written 

i^^(-i, o, 1) = ^ (uq + Ui)-\~{x- 1 ) Au^-i-^x {x-i)A^u_^. ...(ii) 

The interpolation curves for the two intervals o to i and i to 2 
are to have the same tangent when xi = i as the curve of the second 
degree which has Uq, % and ttg ordinates. In a similar manner 
to the above the equation to this curve may be written 

F^{o, I, 2 ) = |(wo + Wi) 4 -(a?-|) AWo+i:x:(:v~i) AV •••(!“) 
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It will be noticed that the forms which we have chosen for the 
equations (ii) and (iii) differ only in the last term. 

Differentiating (ii), 


so that Fq ( — 1, o, i) = Awo —.(iv) 

Differentiating (iii), 

F^' (o, i, 2) = Awo + (x- 1 ) AX» 

and F^ (o, 1,2) = Aw^ + I-A^Wq. .(v) 


The values of (—1,0, i) and F/ (o, i, 2) in (iv) and (v) are 
to be values of the differential coefficients of the required inter¬ 
polation curve for the interval o to i. 

Let be that curve, so that its ordinates when — i, o, 1,2 
form the basis for an ordinary interpolation formula of the third 
degree: 


^ 1) + (- 1 ) ^ ^0 + ' 


(.T — i) A^t^Q + 

2! 2 




The tangents to this curve are given by the equation 
vj =^v^ + {x-l) -J ^ -^ A»z;_i. 


.(vi) 

(vii) 


The conditions to be satisfied are: 


«'o = “o. 


Z'l = Ml, 

®o' = ^0' ( -1 . o. I) =^“o - . 

(o> 2) = AMo + |A%io- .(viii) 


We have at once i (z^o + ®i) —M“o+“i)> these 

determine the first and second terms in (vi). 


r? / 'A lA% + A2z>_, I 

From (vii), = Az;o - --+ — A3z;_i, 


2 







ISO 

and therefore 
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^0 + ®i' = 2 A®o+, 

- ^o' = i (A%+A*®_j). 

But, from (viii), 

®o' + = 2Att(, + ^A®m_i , 

^i'-V = i (AX + A®m_i). 

Comparing the two expressions for Vq -^-v^ we have at once 
A^zLi = 3A®w_i (since A?;o = Az/o), 
and from the two expressions for 


i (A%o + A2z;_j)=:^ (AX + A2w_;j). 

We have now found the values of all four terms of the formula 
(vi), and we can write the formula in terms of u's as follows: 

1 / X / IX A X (x- i) 

= I (iio + «i) + (^ - i) Amo + —-^- 


I x (x-i) (x--t) 

2 ^ 


.(ix) 


This result is a formula of osculatory interpolation y and differs 
from the ordinary central difference formula (i) only in the last 
term. 

The difference is 


. -y Q x - I) ^x(a: -l) { 2 X- 1 ) 

3 ~ 6 


A®m. 


- 1 * 


*26. The problem of osculatory interpolation has been a fruitful 
source of discussion by eminent actuarial authorities. The method 
was devised by Dr Sprague (see J.LA, vol. xxii, p. 270) and was 
subsequently developed by Prof. Karup, Mr George King and 
Dr Buchanan. An elementary demonstration of the method, de¬ 
pending upon advancing differences, is given by Mr King in the 
Supplement to the y^th Annual Report of the Registrar-General, 
The following modification of Sprague’s method, using King’s 
approach, has been suggested by Mr P. G. Neal. 
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Consider the four values of u^: Uq, and Wg- 

The second degree curve through , Uq and Wj is 

/ S A ( 1 + ^ A O 

4- (1 + ix:) Az/_i -i- ^ 

. X ( a ;— i) . „ 

= Wo + ^vAwo 4- A2w_j; 

w^' = Amo4-^A2w_j —|A^w_i 

so that [Ux]x-Q = Aw-o-iA" w_ 3. .(i) 

The second degree curve through Wq, and is 

. a: (^- i) .0 
= Wo + xAwo 4- -- ~ —A“//o; 

/. w^' = Awo + xA^Wo — 2A^Wo 

[<]x-o = Awo4-iA2wo* .(2) 

Let the third degree curve through w_i, Wo, w^ and Wg be 
w^ = ?/o + 4- 4- cx^. 

Then uj = w + zhx 4- 3 cx“, 


a-\-h-\‘C = u^~UQ^ Awo- .(3) 

From (i) w = Az/o —'gA^^-i* .(4) 

From (2) w 4 - 264 * 3 r = Awo 4 -|A^Wo. .(5) 

Whence, solving equations (3), (4) and (5), 
w = Awo - iAX + i-A^W-j, 

6 = iA^Wo —A%_j, 
c ~ ^A^w_i. 


/. w^ = Wo + X (Awo ~ ^A^Wo 4- iA%_i) + (lA^Wo - A®w_.j) 4- x^.]A^w_i 

= Wo + xAwo-f |x (x— i) A2 wo 4-ix (x— i) (x— i) lS?u_i 
(where, it should be noted, the last term replaces 

AX 

in the ordinary advancing difference formula). 
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Putting At/o = Aw_i-f A2 m_i and A^w^ — we obtain 

easily ^ ^ xAz/_i 4-1 + x) ^ 

^i-fX = Wi + ^cAmq + i (x^ + x) A^w^ “ ^ — :x:^) A^Wo» 

which is the form given by King. 


*27. The proof in paragraph 25 above is due to Mr D. C. Fraser, 
and depends on BesseFs formula. The ordinary interpolation 
formula ending with the term A^m_j can be written in many 
different forms, all giving identical results, and the addition of 
jc (.x:— i) (zx— i) 


the expression 


A^w_i produces an osculatory 


formula. 

Suppose, for example, that we take the descending difference 
formula 


, , . A;(jC+i) x(x-—l) .0 

w^ = w_i4-(.x:+i) Aw_i + ---~^^—- A-?/_,+“ ^ A‘^2/_i. 


Adding the term ^ A^m_i we have 

Vj. — u_i-]-{x-\- 1) Aw_i + :v {x+ i) A‘^w,, 


l) ^ (.V—l) (2X—l) 

I 2 6 




. X {^-f i) . ^ (x- 1) 

= Wq + JcAw„i H^ -- A^?/_i H- 


Ahi_ 


which is the form obtained in Lidstone’s proof appended to 
Dr Buchanan’s paper, J.LA, vol. XLii, p. 394. 

An interesting Note on the application of a graphic scheme to for¬ 
mulae of osculatory interpolation appears in the Actuarial Students* 
Magazine, No. 3 (Edinburgh, 1930). By treating osculatory interpolation 
as a particular case of divided differences, Mr Fraser shows that a 
diagram similar to the hexagon diagram for ordinary differences can be 
employed to obtain the various forms of osculatory interpolation formulae. 


*28. Modern writers, notably American and Scandinavian 
actuaries, have of recent years turned their attention to the 
construction of formulae of osculatory interpolation. In a paper 
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read by Dr Buchanan before the Faculty of Actuaries in 1929 there 
is an extremely interesting account of the development of the 
subject up to that date (T.F.A, vol. xii, pp. 117-56 and 277-82). 
Of the numerous formulae given in that paper those of Mr R. 
Henderson and Mr W. A. Jenkins are of particular interest. 
Henderson’s formula is 

+ 6^ (I^ - I) (SX - iSX) 

+ XUj + - I) - jS\), 

where i—i—x and the 8’s are central difference operators in 
Sheppard’s notation. 

It will be seen that this is Everett’s formula with fourth difference 
adjustments. 

Jenkins’s formulae are also based on Everett’s formula. His 
earlier one is 

= I) (^- I) SX 

-hXUi-h^X (x- 1) SX> 

where, as above, ^ — i —x; while his ‘‘modified” osculatory inter¬ 
polation formula (which, in many respects, possesses advantages 
over existing formulae) is 

Mx = + S-Mo- 

+ xui + ^x (x“ — I) 8X ” • 

It should be noted that, in general, Henderson’s and Jenkins’s 
curves do not, as do Sprague’s and King’s, pass through the 
given points. They thus involve an element of adjustment as well 
as interpolation. 

The student who wishes to pursue the subject further would be 
well advised to read this paper, with special reference to Buchanan’s 
demonstration of Jenkins’s modified formula ^nd the derivation of 
the formula from first principles given by Mr D. C. Fraser in the 
discussion which followed the paper. 

A comprehensive study of the methods adopted by the latest writers 
on Osculatory interpolation will be found in an illuminating article by 
Mr J. E. Kerrich in JJ.A, vol. lxvi (pp. 88-124). methods of 
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American and English actuaries differ from those of the Scandinavian 
authorities, and in his paper Mr Kerrich discusses the differences 
between these methods and gives a general procedure for the derivation 
of the various formulae that have been evolved. 


EXAMPLES 7 

1. Show that = o or «! according as w > or — n. 

Hence prove that if n and r are positive integers 

rf -(w - i)^ + n(2> (” ~ ~ ”(3) (w — 3)^ + . ■ ■ = o or nI 

according as r< or =«. 

2. Use differences of zero to find (2-75)^ and (- 

3. Prove that (w + i) A”o^^ 

4. Prove the identity 

?l\ 

A^‘0’^ + «(i)+ «(2)4,... + ----. 

5. Show that (« 4-1) A”o” = 2 [A”~^o” -f A”o^']. 

6. If h'^Uy h'^Uy ... represent differences for intervals of ijm and 
Az/, A‘^z/, A®«, ... differences for unit intervals, then if fifth differences 
are constant, prove that 

_A^o^ A^Wy lowA^o^-A^o''' A^Wq 
4! ' 5! 

7. Show that / {E^) o"* = tC^f (E) o’". 

8. If jr be any quantity less than unity, prove that the limit of the 
series i"4*2"x4-3".x:^H-... to infinity 

= T—• -“Tg Iao" -f —^ A^o" + ( - A^o” 4-... I. 

(i —xy ( I —X ^ I ~x/ I 

9. Prove that the differential coefficient of f (n) with respect to n is 
approximately equal to 

f {/ (^ + I) - I)} - {/ (« + 2) -f (n ~ 2)}. 

10. Find the first three differential coefficients of ^Xy when a; = 50, 
given the following cube roots: 

i^50=:3-684o; 1^51=37084; ^^52 = 37325; ^53 = 375^3; 

'^^54 = 37798; '^55 = 3*8030; Ay56 = 3'8259- 
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11. Ue = i - 5 s 6 , W7= 1*690, ^9 = 1*908, Wi2 = 2*i58. 

Find the value of when a: = 8, by using divided differences. 

12. Prove that 

~ ^ (^x+h ^x—h) (^05-1 2h ~~ ^x~2?i) i^x+3h ““ ^x~2?t) 

13. The first differences of the first differential coefficient of log 
are in geometrical progression. Determine the form of 

14. Show that = AY (^ — f) approximately. 

By considering the function f{x) = a + hx + c^ and using the above 
relation, prove that log c = c'^ — r~^ approximately, where r is a small 
quantity. 

15. Given that = ^2 = 57 > value of 

is 4 when x = o and 72 when x = 2 , find the values of A^'^u^) and 

T 1 

16. Show that (I 4- log EY 0^ = -, when r > m, 

^ ^ ^ (r —w)! 

17. If Bu^ = prove that 

dx ^24 640 

18. Prove that 

+ A 2 w ^_3 

19. Given 

*^ 20 : 16~^*®^4 ^ 20 : 20 ~ 4 ’ 3^4 ^20: 25 ” 3*325 

^25:16 ”^’^29 ^ 25 : 20 ~ 4 ’ 34 ^ 

W 3 o.i 6 = 6 *o 75 

find W23 :i 7 2S accurately as possible. 

20. Obtain a Lagrange formula for given Wo:o> ^1:0^ 

21. Show that if tto:i = ^i:o t^o:2 = ^2:o» ^^en 

^ + v-i 


Ux:v^Uo:o + {x+y) 

22 . ^40:45= 13*133 

«50:40= 11*^98 

Find ^ 42 : 43 * 


A,4-- 


^ 40 :so— I2‘450 

<* 45:46 =12-432 


1 «o:o + *y K:i-«2:o]* 

<*45:40 = i2'88o 
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23. Given the following premiums for endowment assurances, obtain 
as accurately as possible the premium for age 23, term 17 years: 


Age 

r 

Term 15 years 

20 

5*947 

25 

6-046 

30 

6-144 


24. Find W27:34» given 

^20:20 3 * ^ 

«20:25 = 3-2625 

^20:30 = 3*5042 
^20:35 = 3*^45^ 

25 = 3*5000 


Premiums 

_A__ 

Term 20 years Term 25 years 

4*418 3*547 

4*530 


^25 :30 —3*^875 

" 25:35 = 3 *9542 
^^30:30 = 3*9333 
^'30:35 = 4*1417 
%5:35 = 4*5500 


25. The following values of/ (x^y) are given: 

/( 35 . 55) = 10 - 020 . /( 35 . 50)= 11-196, /(35, 45) = 12-019, 
/(40, 5 S)= 9-796, /(40, 50) = 10-894, /(40, 45) = 11-641, 
/( 45 -SS)= 9-583. /( 45 . 50)= 10-591, /( 45 . 45)= 11-243- 

(i) Using only six of the above values, find / (42, 52). 

(ii) Making use of all the data, calculate /(44, 51). 


26. Prove that A can be expressed in either of the two fol¬ 

lowing forms: 

(a) Aw^ -f u^^iW^Av^ + v^vi^Au^. 

{b) Awa-Az^a-Azt^a. H- + two similar terms 

-f Uj^Av^Aw^ -f two similar terms. 

^27. If V/(a:)=/W-/(a;~i), 

prove that 

V”/(x)=/(a:)-M(i,/(*-l) + «,2,/(a:-2)-...+(-i)"/(x-«). 

“*^28. Prove that 

j)V + (jc+i)(i., V24-(a: + 2)(3, V3 + ... 

4-(«4-n)(„+i) V"+i4-...]«o- 
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29. If 

prove that #"1 = {x^-i’‘){x^- 2 ^)... {x^- 

and x[“«+« = x (^®-i) (*“-|) ••• [x^-l (2«-1)1- 

30. Show that 

rl’l , 

/(*)=/(o)+-, 8/(0)+ 8y(o) + ... + --j 8’*/(o) + . 

where xl”! has the same meaning as in Qu. 29, and 



CHAPTER VIII 


MODERN EXTENSIONS AND SPECIAL 
DEVICES 

*1. In this chapter we shall deal briefly with certain modern 
methods of interpolation and summation. The proofs and applica¬ 
tions are not intended to be exhaustive, and for further information 
on any of the theorems or methods a study of the original papers 
or notes is recommended. 

AITKEN’S THEOREM FOR POLYNOMIAL 
INTERPOLATION 

*2. Dr A. C. Aitken has devised a remarkable general theorem for 
polynomial interpolation which covers a very wide field, embracing 
all the ordinary formulae as particular cases. The basis of the 
theorem is that when a difference operator is applied to a given 
polynomial of the wth degree in x, the result is a polynomial of 
the (w— i)th degree, and when performed on a constant the result 
is zero. 'Phis obviously holds when the operator is A. Similarly, 
when is differentiated with respect to x, a polynomial of one 
degree less in x is produced; and if ^ is a constant, then dAjdx 
is o. In this case the operator is D. 

The general operators are d and 0. 0 denotes that form of 
inverse operation which produces a polynomial divisible by x^ 
so that constants (of integration, etc.) do not enter into the inverse 
processes, which are therefore perfectly definite. 

It is evident, in the first place, that whatever operation is repre¬ 
sented by 8 , 01 is Xy and the effect of operating with 0, or a succes¬ 
sion of 0’s, gives a multiple of x. 

From the definition, since Uq is the constant part of which 
is removed by the operation 9 , 


so that 
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If for example 0 = A and the interval of differencing is unity, then 

01 — 0 ^(l)~^( 2 )> 0 <^( 2 )”‘^(3) • • • 

and generally 0‘^(n-i) = ‘^(n)* 

Again, since A (a^- i)(„) = (^- 

© (X-(X- (X- l)(,,) ± I, 

according as n is odd or even. 

Similarly, if 0 = Z>, it can easily be shown that 

0 =06> ~ ^ ^ 

(w—i)! n\ n\ 

*3. The general formula for determining the polynomial (:c), 
being given values when x — o of u^, and of all such results as 
■■■Or... 0 .^diU^, is 

= W() + ©1 ^ + (^2^1 ^o) ©] ©2 I + • • • 

+ ... ft>^,Z/o) 0102 ••• ©7T> 

where the expressions in brackets are numerical values, and the 
expressions following them, such as 0xi, 0i©2i» •••> are functions 
of X, It will be understood that, in each term, the first operator is 
the one that stands next to the operand w or i, and that the re¬ 
maining operators are taken in their order from right to left; in 
this order the subscripts of 9 increase and those of 0 decrease. 

It should be noted also that the 9 's are not necessarily all the same 
operation, nor need they be commutative. The following proof is 
due substantially to Mr G. J. Lidstone. 

First, let be a cubic in x and = ax'^ + bx^ + cx + d. 

Then since is of the third degree, is of the second degree, 
O^OiUx is of the first degree, and is a constant. 

Now 

Hence, putting O-^Ux for 7/^, 02^2 ©i^i similarly, 
e^dj^Ux = B^diUQ -f- Q^dsBzdi Uxy 

O^B.^B-^Ux =^302^1*^0 ^ constant. 
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There is no term in 0 in the last identity because the process 
has been continued until x disappears. 

From the last line above, operating with ©3, 

^3 ^2 ^1 “ ®3 ^3 ^2 ^0 

and since 0^62^6 is constant we may write this line as 

©3 6^3 do 61 Uj. = (f4 Oo Wo) ©3 I. 

Substitute this result in the third line: 

do = do Mo + (^3 ^2 ^o) I • 

Operate on this with ©3 and substitute in the second line: 

d^U^ = Wo + (^2^1 ^)) ®2 I + (^ 3 ^ 2 ^lWo) ©2 ©3 ^ • 

Finally, operate on this with ©^ and substitute in the first line: 

= ^0 + (^1^0) ^ I + Wo) ©2 ©31' 

This process can evidently be extended to the general case, so 
that we arrive at the general formula: 

= Wo + (^1 Wo) ©11 + (<?2^1 Wo) ©1 © 214 -... 

4 {^71 • • • ^2^1 Wo) ©1 ©2 • • • ©n ^> 

which is Aitken’s formula. 

It will be observed that the proof is exactly analogous to that 
of the divided difference formula (Chap. Ill, p. 44). 

* 4 . By replacing the d's by more familiar operators various well- 
known formulae can easily be obtained: 

(i) If ^1 = ^2-^3= ••• =^> shown above, 0 xI=a:(i), 

0^021 —x^o) and the formula in paragraph 3 becomes 

M^ = Mo 4 AuqX ^^) + A 2 woX( 2 ) 4 AX‘^( 3 ) 4 •. • 
which is the ordinary advancing difference formula. 

(ii) If the d's are all equivalent to the operator Z), we have 

x^ x^ 

^ u^ = Uo+DuuX+D^Uo—.+D^u^ — + ... 

2 , 3 . 

which is Maclaurin’s series. 
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(iii) Let OiU^ — {u^ — Ua)l{x —a) so that ^i= A, ^1^2= A^ and so 

a ha 

on; then it can easily be shown that the formula repro¬ 
duces 

+ A«a + (<^-^) {x-b) AXr + -*-> 

b be 

Newton’s divided difference formula. 

* 5 . These paragraphs are based on the note by Dr Aitken in 
y.LA. vol. LXI, pp. 107 et seq. In the same volume appears a 
note on Lidstone’s extension of the theorem to interpolation in 
Everett’s form. In thus extending Aitken’s theorem, Mr Lidstone 
has introduced an operator, A, which reduces the degree of a 
polynomial by 2, and not by i, as in the general form, and the 
corresponding inverse operator A=A”^. This ingenious modifica¬ 
tion results in some very interesting identities and renders the 
development of Everett’s series a comparatively simple matter. 
{Ibid. pp. 113-16 and the original paper cited therein.) 

The student who desires further information on the subject of 
Aitken’s formula is advised to read Dr Aitken’s original paper, 
Proc. Edin. Math. Soc., Series ii, vol. i (1929), pp. 203 et seq. 
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*6. A formula for the finite integration of a function which is 
the product of two other functions is 


An extension of the formula (Chap. VII, paragraph 12) is 

+ ^^«xS^^x+2 -.(«) 

If is a polynomial of the wth degree in x^ differences of a 
higher order than n will vanish, and the last term in the series 
above will be . , 


The series can however be terminated at any stage with a 
remainder term. 


FM A S 


II 
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For example, 

so that = A 

or = .(*) 

If in place of u^, and we write —Au^ and respectively, 

we have 

-2 (Am^2«;^+i)= + (A^m^S^^^+j). 

Identity (b) then becomes 

^Ux^x = - Aw^S^sy^+i + 2 .(0 

By substituting A^u^ and respectively in (c) 

it can easily be seen that the remainder term after (— i)^*^ 

be ^ ^ (A»+XS»+%.+„,,). 

(See G. J. Lidstone, J.l.A, vol. LXiv, pp. i6o et seq.) 

*7. If in formula {a) we replace Uj. by Au^ and Vj, by i, and insert 
the limits x and o, we have 

X 

2 (Am^. l) = AMjX(i)-AX(-r+l)(2) + '^®«x('^ + 2)(3)---- 
0 

But S {AiiJ) = - C, where C is a constant 

= u^-UQy say; 

- Wo = *(1) Am^ - (JC + l) (2) AX + (x + 2 ) (3) AX - • • • 
or «o = «x - ‘*( 1 ) Am* + («; + i )( 2 ) A^w^ - (a: + 2 )( 3 , A^w^. +... 

which is the ordinary advancing formula for Mq in terms of and 
its differences. An alternative form for is 

2m* = m*_i2z>*- Am*_ 22X + AX-sSX-. 

(See Examples VII, No. i8.) If we make the same substitutions 
as above, we shall have 

«X - «o == ^(1) Am*_i - a:(2) AX-2 + (s) AX-s - • • • 
or Wo=M*-«r(i) Am*_i + :*( 2 , AX-2--«(3) AX-3+ •••! 

an interpolation formula involving backward differences. 
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To distinguish between the two expansions for we shall 

therefore refer to formula {a) as the “forward difference” summa¬ 
tion formula and formula {d) as the “backward difference” 
summation formula. 

’*‘8. These formulae can be adapted to the problem of calculation 
of when numerical values of u^. and are available. Mr 

G. J. Lidstone has shown that by making some simple substitu¬ 
tions the arithmetical work is reduced to a minimum. 

The proof below follows Lidstone’s proof ivlJ.I.A, vol. LXX. 

Let denote the indejinite finite integral of u^v^. Then can 
be expressed in either the forward or backward form as given 
above. If we wish to find the sum of n terms we must calculate 
and if in the formula we make either or vanish by 
so arranging the summations as to give zero values to the particular 
values of the sums S?;, we reduce the definite sum to 

a single expression, 5 ^ or — Further, from the zero values we 
can build up the whole table of sums by using the fundamental 
relation = 

Since in either the forward difference or backward difference 
summation formula we may choose our values so that or is 
zero, there will be four forms of tabulation. For the purposes of 
illustrating the process it will be sufficient to use one of these 
forms: they are all illustrated in Lidstone's paper, loc, cit. 

Consider the simple case in which the following values are 
given: 

X o 23 

Uy, 27 64 125 216 

I 3 7 S 

Here n is 4. By simple arithmetic 

= 27 + 192 + 875 + 1080 = 2174. 

(In this example has been taken as (;3c + 3)^ so that third differ¬ 
ences will be constant: the values of are quite arbitrary.) 


ri-2 
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The difference table is 


X 

Aua, 


A»«, 

0 

27 




37 



1 

64 

24 



61 


6 

2 

125 

30 



91 



3 

216 



We will use the backward form, making Sq==o. 


Thus, to use (the last term—in 

this case %) and its backward 

differences, 

we make o = 'Ez\^ = ^h}j,= 

... for x=^o; 

and in that case 


n-l 




0 

■ Am„_2S2w,„ + 


Construct the following table, where the sums 
downwards and stepped down one line at a time: 

of are taken 


^ Vx 




010 

0 

0 0 


I 3 I 

0 

0 0 


274 

I 

0 0 

(«-i) 

3 5 II 

5 

I 0 

(«) 

4 — 16 

16 

6 I 



n-l 


Then, substituting in the formula for S the required sum 


0 

IS 

4-216 X 16= 4-3456 

- 91 X 16= -1456 
4- 30 X 6=4- t8o 

— 6 X I = — 6 

4-2174 

which agrees with the result found by direct calculation. 

*10. The methods adopted in taking any of the other forms are 
similar, and the student will find it instructive to use these other 
forms in further examples of the same type. 
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It should be noted that, by combining two forms, we may t^ke 
the sums from some convenient central point and so reduce the 
figures involved in the summations. 

These formulae and methods are of great use in forming statis¬ 
tical “moments” of the form 


INTERPOLATION BY CROSS-MEANS 


*11. The divided difference formula is 

Ux = UaA\Ua + {x~a) (x-b) +.... 

b be 

If we are given only two values of namely ii^ and tiien 

«x = “a + (^-«) 

b 

= “a + (^ - «) («6 - «<.)/(“^ - «) 

b—x x—a 

= 7 -- Uf, 

b—a b—a 




a~x 
h — x 


■^h — a. 


This is a blend of and in the proportions {b — x)l(b — a) and 
{x~ a)j{b — a) and, from the determinant form, we may note that 
a first difference approximation to u^. is in fact a cross-product 
divided by the difference between the two given arguments. This 
is called a linear cross-mean. 


*12. Dr A. C. Aitken has evolved a method of interpolation by 
cross-means which enables simple arithmetical processes to be used 
without the need for either a formula or differences. 

The following is an elementary description of the method. 

If in the general divided difference formula we substitute 
successively ... for x giving one, two, three ... terms, we 

may write down the following table: 
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No. of 
terms 

One 

Two 

Three 

Four 


«6 =«« + (*-") A«a 

h 

Uc = M„ + (c - a) A«„ + (<^ - a) (c - b) A^Ma 

b be 

“d = “a + ¥- «) AWa + «) i'^-b) AX 

b be 

+ (^/-a) {d-b) (d-c) A®«a 

bed 


Parts 
a — x 

b — x 


c — x 


d — x 


where the column headed “parts” represents the factors to be 
used later. 

The first step in the process is to find first difference approxima¬ 
tions based on and and and ... as in paragraph 11. 
These may be denoted by {a, b), {a^ c), {a^ d) .... In order 

to obtain these approximations we use the parts in the last column 
above, and it is evident that the first two terms of the general 
formula (i.e. u^j^ + {x — a) Aw^;) appear in all the approximations, 

b 


Thus 


{a, d) = 


Ud 


a — x 
d—x 


U{d~a). 


The first difference results are therefore of the form 


+ {x-a) A 4 * (^ — a) 

b 

where represents the multipliers of the common factor (x — a) 
in (a, b)y (a, c), {a,d) 

We may construct a table similar to that above, the expressions 
in the middle columns giving the various values of k-y : 


{a, d) 
(^, e) 


o. AX 

be 


(c-b) AX 

be 


(d-b) A*«a + (d-b){d-c) AX 

be bed 

(e-b) AX + (^-*) (^-c) AX 

be bed 

+ (c-J) (e-c) (e-d) AX 
bode 


b — x 
C — x 


d — x 


e — x 
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The second difference approximations are found in the same 
way by the use of the parts as stated, and the results are of the 
form 

Ux = Wa + {x-a) A\Ua + {x-a){x-b) /^a} + {x-a) {x-b) k^, 

b be 


the first three terms being common to all the values, and the 
respective values of being those in the middle column below: 


u^(a,b,c) o./tha 

bed 

u^{a,h,d) AX 

bed 

{a, b, e) {e - c) A®m„ + {e-c){e- d) AX 

bed hede 


and so on. 


c — x 
d — x 
e — x 


* 13 . An example will make the method clear. In order to simplify 
the arithmetic equidistant intervals have been used, but the 
processes are, of course, perfectly general. 

Example l. 

Find u.j; when ^^ = *7352 from the following data: 

X ~z —1 o I 2 

15S49 16218 16596 16982 17378 

We require the following table: 


a 



h 


(^» b) 

c 


(a, c) (a, b, c) 

d 


{a, d) (a, b, d) (a, b, c, d) 

e 


Vx {a, e) (a, b, e) (a, b, c, e) 

where the parts 

are -27352, -17352. -7352, -2648, 1-2648, 



, b — x x — a 

Now 


Ux{a, 

where 


-1-7352, 



a-*= -2-7352, 



“0=15849. 



16218, 


b — a = i. 


and 
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/. i)= 15849X -i7352-(i62i8x -27352) 

= 16858-3. 

The calculation is performed very quickly by the aid of an arithmo- 
meter. 

Similarly, 

Ux { a , c) = H(i 5849 X -7352)-(16596x -2-7352)], 
since c — a — 2y 

= 16870*6 

and so on. 

Again, u^(a, b, = {a, h) + y_^Ux («. c), 

where —-• *7352, 

-1*7352, 

(a, Z») = 16858*3, 

(a, r)= 16870*6, 

and b — a-i\ 

II (6Z, h, c) — 16879*6. 

The completed table is as under. 




— 2 

15849 





— I 

16218 

16 

858-3 



0 

16596 


870*6 

00 

0 


I 

16982 


882*0 

8*9 9 

•I 

2 

17378 


^ 894*3 

9*1 i 

*4 

In practice, work is saved by omitting 

common figures in 

the columns. 


as in the calculations of the cross-means these figures will merely be 
repeated. 

The final result is 16879. 

The gradual closing-up of the approximation is clearly seen. It is 
best to arrange the w’s in the order in which they come into Gauss’s 
formula, namely, «q, .... In this way quicker convergence 

is secured. 

’•*' 14 . A further adaptation of this method is by the use of quadratic 
cross-means. Here the values of u^. are taken in the order 
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... and the divided difference formula becomes 

U^ = U_a + {x + a) + 

a a,~b 

+ (x^ - a^) (x + b) A®M_„ +.... 

a,-~b,b 

The first stage in the calculations is interpolation to first differ¬ 
ences by linear cross-means as before; at each subsequenrstage, 
by an arithmetically similar process, two further differences are 
allowed for simultaneously. 

Dr Aitken’s method of application is as follows: 

-a)^[(a + x) u^^-^(a-x) w_J/2a. 

Since this expression is unaltered by substituting —a for a, it 
is an even polynomial in say (a^). 

By letting a = xm {a, — a), we have ii^{x, — x), which is (x^). 

The problem is therefore reduced to interpolating for (x^), 
given v^(a^)y (^“)> •••• ^ repeated cross-means are 

therefore available, beginning with 






and similarly for (a^y c^) .... 

Again, 

-- 


{OTy h-y C2) = 


and so on. 

It is evident now that by interpolating on variables of the form 
we are moving by two orders of differences at a time, which 
improves the convergence to a marked degree. Further, with a 
central origin a, b, c ... are -5, 1*5, 2*5 ... and the factors 
(^ — x^y c^ — x^ ... all have the same decimal part. Also, 

the divisors h^~c^y c^ — c^y — ... are all even integers, which 

enables halved cross-multipliers and divisors to be used. Where the 
arguments are *5, 1*5, 2*5 ... these halved divisors become i, 3, 6, 
10 ..., the coefficients in the ordinary binomial expansion. 

Finally, for these arguments the first multiplier is | (J — x^) of 
—x)(|h-x), which is one-half the product of the multipliers 
used in the first linear cross-mean for v {\), 

An example will help to explain the method. 
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Example 2. 

Using the data in Example i, together with ^3 = 17773, interpolate 

to find m. 7352 - 

The extra value is required, as in the quadratic method we need an 
even number of terms. 

The working is as follows: 

The decimal part of the multiplier is 

H73 52 X-2648) =-0973405. 

The v*s are 

•7352x16982+ •2648x16596 = 16879-8 

1- 7352 X 17378 + 1-2648 X 16218 = i6888*9 

2- 7352 X 17773+2-2648 X 15849 = 16901-5 

The next steps are 

879-8 X I -0973405 888-9 X *0973405 = 878*9 

879-8 X 3-0973405-901-5 X -0973405 =879-1 

Further values of v are unnecessary, and the table, similar to that for 
linear cross-means interpolations, is 

16 879-8 -0973405 

888*9 87 I 8-9 1-0973405 

901*5 i 9*1 3*0973405 

Hence w.7332 = ^^879, as before. 

* 15 . The cross-means methods possess particular advantages. No 
interpolation formula and no diflPerences are required and we may 
stop the calculation at any point, so that any required degree of 
accuracy may be attained. For equidistant intervals the quadratic 
method is much to be preferred. When the intervals are unequal 
the linear process has considerable practical value; it is, in fact, one 
of the most direct and effective methods for inverse interpolation. 

Generally, the quadratic method is very advantageous for direct 
interpolation. It can also be readily adapted to inverse interpola¬ 
tion. When the arguments are equally spaced and the calculations 
can be effected by the aid of an arithmometer this device enables 
interpolated results to be obtained much more rapidly than by any 
other method. For instance, in Example 2, after the multiplier 
has been calculated, the successive values of v can be obtained on 
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an arithmometer without clearing the machine after each operation. 
After the method has been mastered simple interpolations can be 
obtained in a very few minutes. 

’*‘ 16 . This part of the chapter is based largely on the note by Mr 
G. J. Lidstone, “Aitken’s new processes for direct and inverse 
interpolation” in J.I.A, voL lxviii, pp. 272-86. In this note 
there is a full and lucid description of the methods of interpolation 
by cross-means with a number of fully-worked examples, and the 
student will find it instructive to compare the advantages and 
disadvantages of interpolation by difference formulae and the 
methods of cross-means, by working out actual numerical questions. 

COMRIE’S “THROW-BACK” DEVICE 

* 17 . In the same paper {loc. cit. pp. 286-93), Mr Lidstone gives 
a comprehensive account (with references) of the throw-back device 
of Dr L. J. Comrie, based on Bessel’s or Everett’s formula. This 
process is of great advantage when the interpolation is to be taken 
beyond third differences, as it shortens the work and reduces the 
number of differences to be recorded in the Tables. The nature of 
the process is indicated in the next paragraph; for a fuller account 
reference may be made to the paper mentioned above and to 
Comrie’s papers cited therein. 

* 18 . In Bessel’s formula (Chap. IV, paragraph 6), let the coefficients 
of the odd differences be called Bhi^ B^u and the coeffi¬ 

cients of the mean even differences B^^u, B^^Uy B^^u .... Then for 
the practical range of interpolation x^o to x—i, it is found that 

B^u 

change so slightly with x that they may, with sufficient accuracy, 
be assigned constant mean values provided that higher differences 
are not too large. 


the ratios 


and 


B^^^u 

B^u* 

B^^u 

B^^u^ 
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Adopting these values the formula takes the convenient form 

= + + [A 2 ~-i 84 A^-f-osSA® ...] Wq 

+ [AS--io8A® + -oi6A’ ...] «o, 

where Aw, A®m, A®w ... represent the odd central differences and 
A“w, Ah/, A®// ... the mean even central differences. 

The quantities in the square brackets are evidently independent 
of X and may therefore be tabulated. 

The throw-back device may be used with great advantage in 
double-entry tables (i.e. functions of x and y). Its use curtails 
very considerably the complicated interpolation formulae otherwise 
required when even moderately high orders of differences are 
involved. 

The device is also very useful for inverse interpolation, since it 
enables differences beyond the third to be included by a direct 
process without increasing the degree of the equation to be solved. 
Further, the last coefficient, is very small and changes slowly, 
so that allowance is easily made for the last term in the formula 
given above. 



CHAPTER IX 


APPROXIMATE INTEGRATION 

1 . In order to obtain the area of a curve by the methods of the 
integral calculus two conditions must necessarily hold. These 
conditions are 

(i) the equation of the curve must be known; and in that event 

(ii) the function == representing the equation of the curve 
must be integrable. 

In the theory of life contingencies these conditions are rarely 
satisfied. Rates of mortality, marriage, etc. are generally obtained 
from actual observations and the functions derived from these rates 
are seldom capable of expression in the form of a mathematical 
expansion. For example, if 4 the number of persons attaining 
exact age x in any year of time, the number living between ages x 
and x-hi is given by 



where x and t are independent. 

Unless 4 +/ follows some definite mathematical law, we cannot 
evaluate the integral by the methods hitherto employed. By making 
certain assumptions, however, we can obtain approximations to 
the value of the integral which are accurate enough for practical 
purposes. 

Thus, if we expand 4 +/ terms of 4 and differences of 4 ? we 
have 

4+<=(i+^y4 

= 4 + ^'^4 hrst differences. 

But ^4 “ ^x+l ^xy 

and 4‘"(r+i ^xy f^he number out of the 4 persons who die 
before reaching age x+i. 
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• • J 

1 

(l^ — td^) dt as far as first differences 

, 0 

~ ^a; \^x 

-4-H4-4-.1) 

= i (4+4+i)- 

This is an approximate value of on the assumption that 
is a first difference function of 4. 

It is evident that a simple approximation to first differences will 
not be justifiable except in a limited number of instances: the con¬ 
struction of formulae for approximate integration will necessarily 
depend on the data available and on the degree of accuracy required. 

2. Simpson’s rule. 

Suppose that we are given two values of only and that we 
represent these two values by means of the points H and K 
whose co-ordinates are (a, z/J and {b, respectively. Suppose 
also that we have sufficient information to justify our represent¬ 
ing the function y — u^ by a curve of small changes of slope, 
not affected by periodic changes. 

An infinite number of such mathe¬ 
matical curves can be drawn to pass 
through H 2J\d K. The simplest 
curve passing through the points 
is a straight line, and for the pur¬ 
pose of interpolation it is often 
sufficient to adopt a first difference 
formula, as was done for example 
in obtaining the approximation to 

in the preceding paragraph. If 
the straight line be drawn, the area 
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cut o£F by the curve, the *-axis and the ordinates x = a and x = b 
will be that of the trapezium HNMK. The area of the trapezium 

isi(“o+«6) (*-«)• 

Accurately, this area is J Uj.dx. 

A first approximation to the integral is therefore J (u 

If and the limits b and a are i and o respectively, 

we have the approximation to dt given above, namely 

J 0 

i (4 + 4+i)* 

Let us now improve the approximation by introducing a third 
known value of u^. In other words, suppose our data to be 
Then there is still an unlimited 
number of curves which can be 
drawn to pass through the three 
points ff, Ky L. The simplest of 
these curves will now no longer be 
a straight line, but a second differ¬ 
ence curve. I'he form of this curve 
will y —ABx -f Cx^y and since 

three points on the curve are given 
we are in possession of sufficient 
data to determine the coefficients 
Ay By C, 

In practice the data will generally be available at equidistant 
intervals, and since we may choose our origin where we please, 
the problem is therefore reduced to one of finding the second 
difference curve which passes through the points (o, w^), (i,Wi), 

(2, «2)- 

Having found the equation of the curve we can then find the 
area enclosed by the curve, the extreme ordinates and the jc-axis, 
by integrating the function thus obtained between the limits o 
and 2. 

Assume therefore that y — u^—a + bx-\‘Cx'^ is the required 
equation. 



Fig. 4. 
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Then, by substitution, 


whence 


Also 


Ui — (Z-h b Cf 

U2 = a-i'2b + 4 c, 


^ i ( - "2 n- 4«1 - 2Uo) = - ^^2 Wo, 

C—l {tl 2 — 2 Uj^-\-UQ) = lA^UQ, 


r 2 ^2 

u^dx== (a 
Jo Jo 


4 - bx -I- cx^‘^) dx 


== + i bx- 4- -3 cx:'^ 


= 2 w 4 - 2 ^ 4 --|c 
= i(z/o + 4 «i + W2)> 
on substituting for a, h and c. 

This is Simpson^s rule for approximate integration. 
We can obtain this formula by alternative methods. 
For example, let 

W 3 . = (I 4- Uq = Wo4-;cAwo -i-lx (x—i) A-Uq 


as far as second differences. 


f 2 r2 

Uj. dx = (wq + -^Awo 4 - ?> a: (a: — I) A^Wo} dx 
_ 0 Jo 

= (wo 4 - a: (AWo — iA^Wo) 4 - j 

Jo 

= j^^Wo 4- ix^ (Awo — o A^Wo) 4- ^:v‘*^A2woJ 


= 2 Wo 4- 2 (Awo - iA^Wo) + fA^Wo 
= 2 Wo 4- 2 A Wo 4- -JA^Wq, 


and since Wo = Wo, 

Wi==Wo 4 -Awo, 

W 2 Wo 4-2 Awo 4-A^Wo, 

this reduces to (z^q- f 4W1 4 -W2) as before. 



CHANGE OF UNIT 177 

A third method is to assume that the integral can be expressed 
in the form 

muQ-\-nu^-{-pU 2 - 

Then iiy = a-\-bx + cx^ we have eventually that 

r Uy,dx^2a + 2b-\-^c^ 

Jo 

as in the first method. 

Substituting in the assumed expression for the integral: 
ma~\-n {a-^b-\-c)-^p {a + 2b +^c) = 2a + 2b-\-%c. 

Whence, by equating coefficients of <2, b, c and solving the 
resulting equations, 

»2=i; « = »; p = \- 


3. Change of unit. 

The formula u^dx=^ -J- (w^ + + Wg) is an approximate formula 

J 0 

obtained by considering unit intervals. If we wish to transform 
the formula to a form in which the interval is changed to, say, n, 
then the given values of will be Uq\ Our new variable 

is z, where z = 7 ix, 

dz = ndx. 


The new limits are evidently o and 2n, 
The formula is 


f“i 

Jo n 

'2n 


i.e. 


/'2r 

Jo 


u^dz=ln (wo + 4Wn + «2j> 


or, on changing the variable to x, 



u^dx=ln{uo + 4u„+ti.„). 


This principle is of universal application and any formula of 
approximate integration can immediately be transferred from unit 
intervals to wthly intervals and vice versa. 


F M A S 


12 
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For example, the approximate formula 
no 

U^dx^\ (Ml + M4 + Mg -f- Mj>) 


/: 


r 20 

becomes u^dx — ^ {u^ + Mg 4 - Mj^ + «ig) 

Jo 


on doubling the interval; 


dx = (tl,, + M4^ + Mg^ + UgJ 

Jo 2 


and 

when the interval is n, 

4. Change of origin. 

Consider again Simpson’s rule: 




U^dx^l (Mo + 4M1 + Mo). 


We have obtained a formula which gives in terms of Mq, Mj and Mg 
the area of the curve cut off by the 
ordinates x^o and x = 2 . If we 
change the origin so that the point 
(i, Mj) becomes the point (o, as 
in Fig. 5, we shall have a formula 
of integration between the limits — i 
and I in terms of Vq and 
The formula will be otherwise un¬ 
altered and we shall have found an 
approximation to the same area with 

reference to a new j-axis Oj Yi. _ 

In its new form the formula be- ^ 
comes 




^-1 


Uq 




Fig. 5 - 




v^dx = J + 4t^o + ^i)» 


or, changing back to m’s, 
•1 




u^dx= I (u_i + 4 M 0 +«i). 
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Now when it is desired to obtain an approximate integration 
formula it is evident that integration between — i and i (or between 

n and n) involves much simpler algebra than integration between 
o and 2 (or between o and zn). Thus, if 

u^ — a + bx-^ cx^y 
then u^y — a, 

u_i — a — h + c, 

Uj = a-i~bi-Cy 

and I («-i+ 4 «o + Wi)= 3 { 4 «o+(»-i+«i)} 

= J (4a + 2a + 2c) 

= l(ba + 2c). 

Also J* (a + bx + CX-) dx = j^a^c + \bx'^ + h cx®J 

= 2a + |c, 

which proves the approximation. 

Again, since in this form the coefficients of odd powers of x 
disappear in the definite integral and also in the paired terms Uf 
and u_fy we could equally well employ as our assumed expansion 
for Uj. the third difference function a-{-hx + cx'^’{-dx^. 

We should have 

4^0 + (w_i + Wi) = 4a + <2 — i + r — + 6 + c + d = + 

as before, and 

\^x + \bx^ + + \dx^ —2a-\- fc. 

This leads to the important fact that Simpson’s rule is true to 
one more order of differences than was originally assumed, i,e. 
to third differences. 

Generally, formulae involving 2 r+ i terms, placed symmetrically 
with reference to the centre of the range, are correct to (2r4-i)th 
differences. 

For the above reasons, namely 
(i) for simplicity in working; 

12-2 



i8o 


FINITE DIFFERENCES 


(ii) to enable us to find the true order of differences to which 
the approximation is correct; 

it is advisable to integrate between — n and n in preference to any 
other limits. 

This can always be done by a suitable change of limits; for 
example, integration between | and i can be simplified to integra¬ 
tion between, say, 2 and 4 by increasing the interval from ^ to 2. 
Then, by changing the origin, a formula can be obtained for inte¬ 
gration between — i and i. To express the formula in the required 
form the process can then be reversed. 


Example l. 


f I 

Show that v^dx— -f 227^2 + ^3) 

Jrs 24 

f ^ ^ I f ^ 

v^dx^ u^.dx (if we double the interval). 

Jl-5 2j3 

This will produce a formula in u^\ Uq. 


In the first place we will obtain a formula for J u^dx. 

This formula will have to be of the form mw_2 + w«osince 
we have moved our origin to the point where x was originally 4. 


Let 

Then 

and 
so that 


— + cx^ + dx^. 

u_2 — a — zh + ^c — 8^/, 
u>2 — CL z»h -f- 4^ “f* 8^, 

• 2 ^: (*^—2 22t/(j + ^ 2 ) ~ "^4 {ZZCL + ZCL + 8r) = <7 4- 


■] 
J — 


ujx^ 


ax-\-\bx^-{- \cx^ + \dx^ 


= 2<^ 4 ic. 


J-i 


(m„2 + 22Wo + W2) = ; 
24 ^ 


I u^dx. 

Reverting to the first origin: 

I I r ^ 

- (1/2 422W44~ “ u^dx, 

24 2 j 3 

r 2-5 

{v^ 4 22??2 + ^’3) = 

Jl -5 


I 

24 
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Moreover, although only three values of have been given (so that 
only second differences are known), the formula is true to third 
differences, for it is symmetrical about the middle point. 


5. Some well-known approximate integration formulae. 

(i) Extension of Simpson's rule. 

We have f dx = (w, + ^u^ + u.), 

Jo 


Therefore by changing the origin 


Similarly 


/: 

/: 


U^dx--=l^ + + 

Uj. dx = I (W4 + 4w- + Uq). 


In general, by addition, 



{ Uq + 4Wj 4- 2^2 + 41/3 -I- 4- ... 4- 


This is a formula for approximate integration used extensively 
in practical mathematics. In engineering problems, where the 
curve to be integrated has actually been sketched as the result of 
experiments, a small unit may be chosen and a large number of 
ordinates drawn. In that event the extended formula can be ap¬ 
plied. This will in general give better results than the single formula 
r2n n 

uJx = -(Uf, + ^u„ + n2„). 

Jo 3 


It should be noted that Simpson's rule, applied in sections (as above), 
does not assume that a smooth curve can be drawn between all the 
points «Q, u^y ... Mgn. The method of obtaining the formula has in fact 
been to draw a number of disjoimted curves between Uq, 

••• ^2n-2y ^n~v ^2n> ^nd the curve passing through three points such as 
Wq, Wj, «2 will not as a rule pass through the next series W3 and 1/4. 

(2) The ''three-eighths" rule. 

The following symmetrical formula can be derived by working 
on the above lines, when four consecutive points are given. 
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If the four points were we should commonly in¬ 

tegrate between o and 3. Change the unit and origin so that we 
pass firstly to the limits o and 6 and secondly to the limits — 3 and 3. 
r 3 

We require Uj.dXy given w_3, W3. 

Let Uy, — a-\'bx + cx^ -f dx^, 

so that u_i -{-Ui = 2 (a-i-c) 

and w_3 + 1/3 = 2 {a + gc), 

" J’= //^dx — m (//_! + z/j) + // (//_3 + 7/3), 

we have 

+ \bx^ + \cx^ + \dx^ =m (u_^ + Wi) + n + z/3); 

i.e. 6a+iSc — m (z/_i + Wj) + w (w_3 + 2/3) 

= rn (za + zc) + 7 i (za + 1SV’) 

= a (zm + zn) + r (aw + 18;/), 
from which, easily, m=i8/8 and n = 6IH. 

J u^dx = ^{6(n_i + ui) + 2(ii_3 + 7 t.j)}, 
r” 

i.e. u^dx = i{6(u2 + Ui) + 2{t/„ + Ufi)], 

J 0 

or u^dx = % {3 (Ml + M2) + (m„ + M3)} 

= i («o + 3«i + 3«2 + “3) 

= |(i+£) 3 mo. 

This formula, like Simpson’s rule, is correct to third differences. 
(3) Weddle^s rule. 

Suppose that seven equidistant ordinates are given. In order 
to reduce the algebra to a minimum, we integrate between limits 
— 3 and +3, and write the assumed formula for thus: 

- dx{x^—i) ex^{x^—i) fx (x^—i) (x^~a) 

u^=:a + bx + — +—^-^ +-- Ll - 

^ 26 24 120 

(a:®-1) (a:2_4) 



Weddle's rule 


This is Stirling's formula with the constants a, b, Cy 
placing the differences of 
Then 


183 

... g re- 



bx^ 

cx^ d 

(x^ x'^X e (x^ 

ax-\ - 


— +- — 

2 

6 6’ 

U 2/ 24 \ 5 3/ 


120\6 4 2 / 72o\7 5 3 /J_3 

6. + 54 - +-^ ) + 486 f ] g 

\6 72 540/ ^ \l20 720/ 280 


= + 9c + 

Replacing a, c, <?, g by the differences of in Stirling's formula, 
we have 

J u^dx = 6 a + gc + ^e+~^^^^g 

= 6m(, + 9A“m_i +1 i5 A'm ,.2 + jVo A®?/_3. 

If now A^w_2 expressed in terms of 

t/_2, ... //2, we shall obtain a formula correct to seventh differ¬ 
ences. It will be found, however, that the coefficients of the terms 
are large. This is due to the awkward fraction multiplying 
AV_3. As sixth differences are usually small, the error involved 
in replacing general, 

be negligible. By this substitution the coefficients in the final 
formula will be much simplified. The modified formula will 
involve an error of Yi^A®w_3 and will therefore be strictly correct 
to fifth differences only but virtually correct to seventh differences. 

The terms in the expression thus adjusted, namely. 


6//0 4- 9A2w_i + ff A %_2 +A«w_3, 


may be expressed in terms of the w’s and collected as shown in 
the following table; 



W _3 

t/_2 

i/_i 

Wo 

Wi 

W2 

W3 

61/0 




60 







90 

—18'0 

90 



3 - 3 A‘«_j 


3*3 

“ 13*2 

19-8 

-13-2 

i 3-3 


• 3 A*U -3 

•3 

-1*8 

4'5 

— 60 

4*5 

- 1-8 

•3 

Total ... 

*3 

^‘5 

•3 

1-8 ' 

•3 

1-5 

•3 
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Therefore j u^dx is approximately equal to 

•3 m_3 + i-sm_2 + SU-i +1 •8i/o+-S^i + I‘S«2 + 0 ’ 3 W 3 

= Iff [("-3 + “s) + 5 («-2 + “ 2 ) + («-i + «i) + 6«/o] 
or u^dx = [(m„ + m«) + s (tti + Ms) + {u^ + M4) + 6m;J. 

This is Wedclle^s rule. 

An alternative method of obtaining Weddle’s rule is by combining 
a 7-ordinate Simpson’s formula with a 7-ordinate three-eighths formula 
in the ratio 9: —4. This method proceeds by eliminating the principal 
error terms in each of the two formulae involved. 

(4) Hardy's formulae. 

Certain approximate formulae due to G. F. Flardy are used 
extensively in actuarial work. 

Let u^=^a-\- bx -h cx^4 - dx ^-f ex^ 4- fxr* 


so that 


= 6^2 4 - i8r 4 - ; 


u^^ — a\ u_^ + u.^ — 2a + ^c + '^2e\ i62e. 

Solving for a, c and e, and substituting: 


J U^dx = 2 ' 2 Wo -f 1-62 (w_2 4 - M2) + *28 (m_3 4 - M3) 

or J M^i/x = 2*2M3 4 -1*62 (Mi 4-M5)4-*28 (Mo + Mg) 

= *28 (Mq 4 -Mg) 4 - 1*62 (M] 4 -M 5 ) 4 - 2 * 2 M 3 , 
which is Hardy’s “formula (37) 

If the interval of differencing be n, this becomes 


/*6ti 

J ^ U^dx = n {-28 (Mp + ««„) + I -62 (m„ + «5„) + 2-2M3 J. 


Similarly 


/•12n 

u^dx = n {-28 (Me„ + «i2„) +1-62 («7„ + «„„) + 2-2«9„}, 

J 6n 


and so on. 



hardy’s formulae 


185 

Since 

poo p6?i pi2n pl8n- 

u^dx—\ iij.dx-\-\ Uj^dx-{-\ u^dx-\-.,,y 

JO Jo J 6n J 12n 

/* QO 

U^dx = n{-28 (Mo+2M6„ + 2Ki2n+...) 

+ 1-62 (u„ + Us„ + U^„+...) + 2-2 (M3„ + «9„ 

This is Hardy’s “formula (38)”. 

If we are dealing with functions derived from a mortality table, 
and we choose n so that yn falls just within or just without the 
limits of the table—so that yn is in fact about 100—we can write 
the above formula thus: 


/: 


u^dx = n (-282/0+ I*622/„ + 2-22/3^+ I -622/5,+ -562/3^+ I-622/7, J. 


In this form the formula is known as Hardy’s “formula (39/2)”. 
(See G. King, Life Contingencies, p. 488.) 


6. (i) Since all these approximations give in effect the area of 
the curve bounded by n^, the limiting ordinates and the x-axis, 
they are often termed “quadrature formulae’’, the word “quadra¬ 
ture” being defined as the exact or approximate calculation of the 
area of the square equal in area to that of the given figure. 

(ii) In any of the foregoing formulae the sum of the coefficients 
must equal the range of integration, where this is finite. This may 
easily be seen by making all the 2/’s involved equal to the same 
constant. 


7. Practical applications of the formulae. 

Integration formulae of the above type may be used to advantage 
in obtaining approximations to the values of certain complicated 
forms of integral which occur in the theory of life contingencies. 
Where the functions involved are such that the summation extends 
to the end of the life table, it is customary to calculate the values 
of the functions by Hardy’s formula (39^). If, on the other hand, 
the upper limit is at an age-point short of the limiting age of the 
life table, any of the simpler integration formulae can be employed 
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to advantage. (See, for example, Spurgeon, Life Contingenciesy 
pp. 262, 263.) 

It is evident that we can neither integrate the function nor 
interpolate between given values of if we are absolutely without 
information regarding the value of the function between the 
ordinates. For example, in a mortality table we may integrate 
between, say, 4^ and 4i> assuming first differences constant, for we 
know that in general the decrements between ages 30 and 31 are 
small and may be fairly considered as being evenly distributed over 
the interval. We could not reasonably adopt the same assumption 
for interpolation between 4 ^tid 4 , since the deaths of infants in 
the first year of age are not evenly distributed over the year. Again, 
a jeliable estimate of the population of a seaside resort in June of 
any calendar year would not be obtained by a first difference 
interpolation between the population figures for January and 
December of the same year. In the two latter illustrations further 
information would be necessary before we could proceed to inter¬ 
polation or integration. 

In applying the formulae of approximate integration to the 
solution of a problem it is therefore essential that we have sufficient 
knowledge of the function to justify our assumptions regarding 
the nature of the curve passing through the given points. 

When the function to be integrated is one in which the neglected 
differential coefficients of the curve are small between the limits 
of the integration, almost any of the above formulae will give 
satisfactory results. If these differential coefficients are not small, 
i.e. those of lower order are changing rapidly between the limits, 
there may be considerable differences between the approximate 
results. 

Example 2. 

The formula for a continuous annuity-certain is 

^m\-\ e-^^dty 

Jo 

where 8 is the force of interest corresponding to the rate of interest 1 at 
which the annuity is valued. 
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By means of an approximate integration formula find the value of 
at 4 per cent. 

Since we have to evaluate we shall require some or all of the values 
of Ui from f = o to ^ = 6, where 

By a well-known formula in the theory of interest, 

where = (i + x)-^. 

Values of at 4 per cent, are available from any tables of interest. 
We have, therefore: 

"j=c® =1, 

= -96154, 

U2 = e-^ = v"= -92456, 
t/.j = = z)* = -88900, 

Ui = e~^ = v^= -85480, 

«5 = e“-^ = z)®= -82193, 

;/u = = -79031. 


The following results will be obtained: 

(i) Simpson’s rule applied three times: 

H“o + 4"i + 2 W 2 + 4«3 + 2«4 + 4"o + "c) 

= 5-3463o. 

(ii) The three-eighths rule (with interval of differencing 2): 

f {2 («u + 3 M 2 + 3 t <4 + «6)} 

= 5-34629 

(iii) Weddle’s rule: 

■3 (“0 + + “2 + 6M3 + K4 + 5 % + “e) 

= 5-34631- 


ro 

dt will be 


Jo 

L 8 J 


or 


7 ^ - 


-, and since the value of 8 at 4 per cent, as given by an interest 

u 

table is -039220713..., the integral becomes 


I- -7903145257. 
•039220713... 


- =5-34629.... 


The three approximate formulae all give results which differ very 
little from this. The excellence of the approximations is due to the fact 
that between the limits t^o and t — 6 the higher differential coefficients 
of are very small and the range is short. 
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8. The Euler-Maclaurin expansion. 

We have considered quadrature formulae in which the integral 
is expressed in terms of a number of particular ordinates. All 
formulae of this type can be deduced in a straightforward manner 
by means of Lagrange’s interpolation formula, as shown at the 
end of this chapter. We will now proceed to investigate the pro¬ 
blem of approximate integration more generally. 

The basic formula for expressing a definite integral in terms of 
given ordinates, with adjustments, is the Euler-Maclaurin expan¬ 
sion. This expansion is in effect of similar form to the expressions 
already obtained, a greater degree of accuracy being ensured by 
the addition of functions, not of the ordinates themselves, but of 
derivatives of certain of the ordinates. 

The formula may be derived by the expansion of operators, thus: 


^ f{x)=f{o)+f{i) + ...+f{n-i) 

x=-0 


= F {n) — F (o), where / (jc) is AF (x). 

Since /(.x:) = AF (x)y 
F(x) = A-^f(x) 

~{e^^— i)^^ f (;v) (since iH-A = ^^. Chap. VII, para. 6) 


I 


2 12 720 


•• /W 


=£»-VW-i/W+§/W-|-^/W... 

= (fix) dx-if{x) + — i- .... 

^ ^ 12 dx 720 dx^ 
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Between limits o and w, we have therefore 

F(; 7 )-F(o) = J 7 (*) dx-\ {/(„)-/(o)j + -L {/' («)-/' (o)} 

-jio (o)}.... 

For (w) — jp (o) we may write 

- /(*) or /(o)+/(i)+/(2) + -.+/(w-i). 

Jo ^ ‘^■^ =/(°) +/ (0 +/ ( 2 ) + •••+/(«- i) + i {/(«) -/(o)) 

- A {/' («)-/' (o)} + 7fo {/"' («)-/”' (O)} ... 

= i/(0)+/(l)+/(2)+...+/(«-l) + i/(«) 

- iV {/' («)-/' (o)) + rb {/"' («)-/"' (o)).... 
This is a simple form of the Euler-Maclaurin expansion. 

A more general form can be obtained by changing the origin 
to the point a and the unit of measurement to r, thus: 
j rainr 

r Ja f{x)dx=^y{a)+f{a + r)+f{a + 2r) + ... 

+f {a + n =Tr) + ^/ (a + nr) - ^ {/' (« + ”0 -/' (a)} 

+ ~--Af"’{a + nr)-r'ia)}..., 


It will be noted that, since 




we can express the coefficients in i)"^ in terms of Bernoiiilli’s 

numbers. As, however, the resulting approximation formula is to be 
used for numerical computation, it is of advantage to give the coefficients 
their actual numerical values. 


9 . The following examples are illustrative of the use of the 
formula. 


Example 3. 

f ^ {lx 

Evaluate —— to five places of decimals. 
Jo i+x 
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Choose a convenient unit, say 0‘i. Then in the Euler-Maclaurin 
expansion we have 

a = o, w = io, r = o*i, and = 
duj. _ I (Pu _ 6 

/7v (t -l. * //v3 ? T v\4* 


I I I 

- 1 _ ^ _ 

1*9 2 2 


a = o, 

w = io, r = 

0*1, and 

dUg. 

I 

(Pu 

dx 

- (i+»-)^’ 


dx I 

I 1 I 

I 

— — • . 

_j-[_ 

-b +. 

i+x 2 

I i*i 1*2 

1-3 


0*1 

r I 


— 

-^-b 


12 

L 2 ^ : 


12 {_ 2 ^^ 720 L 2^'^I^J 

13 I ^15 

= -coooo— —-.6. . 

120 4 720000 16 

*90909 

•83333 

•76923 

•71429 

•66667 

•62500 

•58824 

•55556 

•52632 

•25000 

6-93773 

= 6*93773 - *00625 •+■ *00001 
=6-93149. 

f ^ dx 

/. = *69315 to five places of decimals. 

J0 I 

This agrees with the true result (log^ 2) to the required degree of 
accuracy. 

Example 4. 

Find the sum of the fourth powers of the first n natural numbers by 
means of the Euler-Maclaurin formula. 

In the formula 


Ug.dx — i Uq ■4' “h ^a+ 2 T "!"••• 

“b \ ^a-^nr I” a+«r 


^ r t * ttf tt$ 

a+nr ■“ a+«r W a) "b (w a+-nr~"*^ a) •••> 
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n 

+ 2^ 4 -3^ + ... 

Jo 


+ (n-i)* + ~n*-~ [4x%_„ + ^^^[4.3. 
since higher differential coefficients of than the fourth will be zero. 


I.e. 


5J 


n r~ I T T 

= i: + J 24 «. 

0 r-l 2 12 720 


r~n 

r-1 


5 2 3 30 


By proceeding on the above lines it can easily be proved that the general 
formula for the sum of the />th powers of the first n natural numbers is 

H - — p (p — 1) (p— 2) 

r«i p+i 2 12^ 720^^^ ^ 

,/>(. />-i) (/>-2) (/ >--3)(/>~4) „p_5 


30,240 


10. Lubbock’s formula. 

The previous formulae have been developed for the purpose of 
relating a definite integral to the sum of a number of weighted 
ordinates at finite distances apart. We have, in effect, obtained 
approximate formulae for the value of 

Lt h (m„ 4 - + ^a+ 2 h + • • • + 

h-yO 

In addition to formulae of this type we can find expressions 
which enable us to find the value of the sum of a number of or¬ 
dinates at finite distances apart in terms of the ordinates at greater 
or less finite intervals apart. Thus, if for the curve y = there are 
m unit intervals, so that we have from ^ = 0 to x = m — i, we 
may develop a relationship between this sum and the sum when 
the intervals are, say, h, where nh=i. 

In place of the sum A + + +•••we may 

consider, without loss of generality, the simpler form when the 
series commences with Uq, 

Let there be tn ordinates at unit distances apart, and let each of 
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these unit distances be divided into n equal parts, so that the new 
ordinates are 

^r/m 

We have S = Uo + u^i„ + Mj,,, +... + u^_y„ 

E”'-i A S’"-! 


By ordinary algebra, putting h for ijtiy 
A A A 


(i + A)'^—! /tA +A(2)A2 +... 

I /2-I /2-—I /z2—I 

= ~ A+ - A^-A^.,., 

h 2 12 24 

so that the operator is 

n—i^ n^—i 


n + - 


A^l-- .A24-— .A 3 .... 


2 n i 2 n^ 2 ^rr 

To obtain S we apply this operator to 

- Wo = Wo + Wi + ... + w,,,_i . 

We note the first difference of this is 

(£'”'-i)mo = w„-Mo» 

the second difference is Aw„^ —Awq and so on. 
Thus, applying the operator, 

= Wo + Uii^^ + U2/n + ... + 


n — i 


W“ — I 


= « (Mo + «1 + . •. + «m-l) + -~Y~ (“»« ~ "0)-“ ^“0) 

This is Lubbock’s formula. 

The coefficients of higher differences than the second were not 
given by Lubbock and are cumbersome. The terms in A^u and 
A^w are respectively 

(n^— i) (19^3 — i) 


7207/3 


(AX-A-X) 
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and 


(yz^ i) (9n ^_^ ^ 

480w 3 “m ^ “O^ 


If the interval of differencing be originally n instead of unity, 
Lubbock’s formula becomes, on changing the unit, 




n—i 


= n («o + «» + W2« + • • • + + — — (“mn “ «o) 




*11. Lubbock’s original formula involves ... and these 

involve Wm+2» “•> values of m beyond the range of sum¬ 

mation ; these w’s may have to be specially calculated or may even 
be completely unavailable. De Morgan {Differential and Integral 
Calculus, pp. 317-18) and T. B. Sprague {J.LA. vol. xviii, pp. 
309-10) transformed the formula so as to involve only Wq ••• 

If the numerical values (without sign) of the coefficients of the 
differences in Lubbock’s formula be represented by Cj, Cg, the 
transformed formula is 

« (Mo + Ml + • • • + Um-l) + ” ^ i«m “ “o) “ Q (Am^_i - AKo) 

- Q (AX.-2 + - Q - AX) - C* (A^M„„4 + AX)- • • , 

where all C coefficients are negative and the terms involve alter¬ 
nately the difference between the terminal odd differences and the 
sum of the terminal even differences. The differences 

> • • • concluding differences of the scheme based on 

Wq ... u^, and lie on a diagonal sloping upwards (Sprague, /or. cit, 

p- 310)- 

12. Woolhouse’s formula. 

Although Lubbock’s formula has the advantage that it may be 
used when the function is not capable of expression as a mathe¬ 
matical expansion—as for example when the data are based on a 
mortality table—there are disadvantages in adopting the formula. 
In the first place, if it is necessary to proceed further than second 


F M AS 


13 
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differences the calculations are heavy, and secondly it may happen 
that the differences converge slowly, so that if we stop at second 
or third differences we are likely to obtain a result differing con¬ 
siderably from the true value of the function. In the case of a 
mathematical function whose differential coefficients are easily 
evaluated an alternative summation formula can be adopted in 
which differential coefficients of odd order replace the finite 
differences in Lubbock’s formula. 

The formula involving differential coefficients is due to Wool- 
house, and may be developed directly from the Euler-Maclaurin 
expansion. 

The Euler-Maclaurin expansion is 


rm j 

u^dx = |Mo + «1 + “2 + • • • + »,«-! + 2W,»- „ Ki - O 
Jo 12 

If the interval of differencing be i/w, the formula becomes 

rm j 

n u^dx = (iuo + + %„ + ... + i tij - (u„; - ?/,/) 

Jo 

+ -Wo'")-••• 

If, however, we multiply both sides of the first expression by n 
we have 


n u^dx = n{^Uo + Ui + U2+... + luJ-~(uJ-Uo') 

Jo 12 


rm 

Equating the two values of w Uj.dx: 

Jo 


+ -K'"-Wo'").... 


+ U^jn + • • • + “o ) + y20«® ^ " 

= n ( JMq + «! + «2 + • • • + 2Wm) “ “ “o') ^ “ ^o") • • • 
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or 


^0 + + %71 + ... + II- \ (uq H- u„^) - — ' - u^') 


+ - 


720W' 


(^m ^0 ) ••• 


= n {uq H- Wj + Wo +... + w„,) - ^ (wo + w,^) - - ~ (w^,/ “ z/(/) 


Re-arranging: 

^0 + ^ll7i + *^2/w. + ... -f — w (w^ + Wj + Wo 4-... + w,^) 


I '"X 

+ 72‘S(“"‘ “““ ^•- 


W — I 


(«0 + «m)- 


77 " — I w * — r 

..i- ^ (uJ-Uo')+ -,(w,/"-w;").... 


If the unit of measurement be changed to w, we have 
Wo 4 Wji 4 7/3 + • • • + ^ 7 ?w = ^ i^h + 4 W 2 „ 4 ... 4 w,,,^,) 

n~i , . rfi~i , , ,, ,,, ,,,, 

^ (^mn “ ^0 ) “^ 720~ ) ••'> 

the usual form of Woolhouse's formula. 

It should be noted that, by replacing the derivatives of w by 
tlieir values in finite differences, Lubbock's formula can be ob¬ 
tained directly from the formula above. 

In applying these formulae to certain actuarial functions the values of z/, 
du d^u 

dx* do^' ••• the end of the mortality table will disappear. Woolhouse’s 
formula may then be written as 


71 — 1 


n ^—1 


n^—i 


-{UQ+Ul + ,..)-~— tln+ -Wo'- •••• 

^ ' 271 ^ 1271* ^ J207l^ ^ 

This is a convenient form for expressing the value of a benefit paid 
at wthly intervals in terms of the values at intervals of a year. 


13-2 
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13* Other formulae for approximate integration. 

It will have been observed that in the formulae of the type of 
Simpson’s, Weddle’s, etc., the function of the w’s is symmetrical 
about the central value. If, however, a number of fixed ordinates 
be given and it is desired to obtain an approximation to the area 
of a curve in terms of these ordinates, the resulting form will 
not necessarily be symmetrical. Again, the formula for the area 
of the curve may be related not only to ordinates falling within 
the area to be measured, but to ordinates outside the area. It 
may be noted therefore that although standard formulae are avail¬ 
able, it is not difficult to devise approximations to fit the particular 
problems under investigation. (See paragraph 15.) 


Example 5. 

If is of the form a + bx-^cx^^ find a formula for u^dx in terms 

Jo 

of iiQy Uj and Wg* 

The interpolation formula which involves the terms Wq, and Wg is 

Uj^ = Uq + xAuq + (a: — i) 



xuq + \x^Au^ -f- ( - ix^) A^u^) 


1 

0 




= f 2 + 1^2 ^1 “ i2 ^2* 


The required formula is therefore 

f ( 5«0 + H~W2)* 

Jo 


It should be noted that (i) the expression is unsymmetrical in the «’s, 
and (ii) the ordinate Wg falls without the area to be integrated. 


Example 6. 

If Ug, is of the same form as in the example above, derive a formula 

for Ug^dx in terms of w, and Mg. 

Jo 

Let Ugg be expressed in the divided difference formula, thus: 

Ug, = M_1 + (X + l) A«_l + (^ + l) “ l) 

1 1,2 
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+ l - +ac] Aw_i+(- x\ A^«-i 


= +1 Am_i -1 A ^ m -1 ; 

1 1,2 


A^w_i = 

1,2 


W.> — z/, - z/_ 




On reduction, the integral is easily found to be 
3V(5“-i + 39«i-8«2). 

It is obviously impossible to quote all the formulae that are in 
current use. Further examples and illustrations of various approxi¬ 
mation integration formulae will be found in the following sources: 
Whittaker and Robinson’s Calculus of Observations, Chap, vii; 
C. H. Wickens, JJ.A. vol. liv, pp. 209-13 ; A. E. King, T.F.A, 
vol. IX, pp. 218-31; Elderton’s Frequency Curves and Correlation, 
3rd ed., pp. 26-8; J. Buchanan, vol. xxxvii, p. 384. 

14. Alternative methods of proof of the formulae. 

It has been stated above that the Euler-Maclaurin expansion can 
be used as the basic quadrature formula. It will be instructive 
to develop Simpson’s formula from this expansion. 


ujx=^n (il/o + + ... + + iWamn) - - , {u'zmn “ O 


approximately. 

Writing 2n for n, but preserving the same range o to 27 nn, 


e^mn 

Jo 


dx 2// (i Uq + U2J1 + U^j^ + ... + tl^ 2 m~ 2 )n "b 




Subtracting this from four times the first: 


U^ dx == 2« ( J Uq + 2 U,, + ttgn + + ... + W(2,«_2)n 


A F \^2mv) > 
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ujx = ; (Mo + 4 «» + 2 « 2 n + 4 « 3 » + ■ ■ • + 2 M< 2 m- 2 )« 

Jo 3 

4^(2m'-l)n 4" ^2mw)> 

which is the extended Simpson’s formula. 

15. Proofs by Lagrange’s formula. 

We have shown in paragraph z (p. 176) that we may adopt the 
expression for in terms of Uq and differences of Uq as the assumed 
form of function. Since Lagrange’s formula is based on the same 
assumption it is evident that, given selected ordinates, at either 
equal or unequal intervals, we can obtain any approximate formula 
by the use of the Lagrange formula. 

For example, given Wg, Wj, 

(x~l)(x — 2) x(x — 2) x(x~l) 


I U^^ — 3x1 + 2) 4 - Wi {zx — X^) + ^U2 {x^ — x)y 


so that 


J. h 16 - 4 -+*)+”■ r -1)+"■ (6 - 4 )J. 

“ 3 (^0 + 4W1 + which is Simpson’s rule. 

If we integrate between o and i we shall obtain 

u^dx=-^ (5W0 + 8«i - M2), 

the formula given in Example 5. 

By this method it is easy and straightforward to deduce special 
formulae for particular cases. 


*16. Remainder terms. 

It is known (Chap. Ill, paragraph 17) that if an interpolation 
formula is used to find from n values «„, ... the error 

involved, or remainder term, is of the form 

d'^ 

(x-a)(x-b) ...(x-k)^Ui, 
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where is an unknown number falling in the range which includes 
a, b k. By means of this result, expressions in similar form 
have been found for the errors or remainder terms of the principal 
quadrature formulae. 

For example, if the range of integration is a to 6, the remainder 
term in Simpson’s formula is 


-•00035 {b-af 


dx* 


and that of the three-eighths rule is 

— *00016 (b — aY tit. 
^ ^ dx^ ^ 


We cannot usually find but if the maximum value of the 
differential coefficient can be found the expression gives an upper 
limit for the error. 

For further information on this point the student is referred to 
Steffenscn, Interpolationy Sections 12-16, and Milne-Thomson, 
Calculus of Finite DifferenceSy Chap. vji. 


EXAMPLES 9 

I. Prove that 

r 2« ra 

u^dx=^\ + 

Jo Jo 


and illustrate the result geometrically. 

2. \i Ug^ ~ a+ bx + prove that 


u^dx = 2«2 + tV “ 2r/2 + «4)» 


and hence find an approximate value for 


10 


dx. 


3. Show that J u^dx = + Swq- t/.j) approximately. 

Find the approximate mileage travelled between 12.0 and 12.30 by 
use of the above formula, from the following: 
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Time 

Speed (m.p.h.) 

11.50 

24-2 

12.0 

35-0 

12.10 

41-3 

12.20 

42-8 

12.30 

39-2 


4. Prove that, if ti^ is a rational integral function of x, then 
f ^ 5 

Ug.dx = ae^ (i — aD -f +...) 


where D = - . 

ax 


5. Show that the area of a curve, divided into n parts by « +1 equi¬ 
distant ordinates Mq, Wj, ... u^t is given approximately by the series 


. in^ 

nuQ + — Auq +' 

^ 13 


W-] A-/y„ 
2) 1.2 


' — rr + n^ 


1.2.3 


to « + I tenns. 


6. Between the limits x = o and x~n the functions and dujdx 
are continuously increasing. 

rn n —1 

Show that u^dx is less than 'L \Uj^, 

Jo 1 

7. Obtain the approximate formula 

J~1 12 


showing up to what order of differences it holds. 

8. Assuming Uy. to be of the fourth degree in x^ express 
terms of and W4, 


f 5 

Uy.dx in 
Jo 


9. A plane area is bounded by a curve, the axis of x, and two ordinates. 
The area is divided into five figures by equidistant ordinates 2 inches 
apart, the lengths of the ordinates being 21*65, 21*04, 20*35, 19*61, 
18*75 17*80 inches respectively. Apply the method of integration 

to obtain an approximate value of the area. 


xo. Prove the approximate formula 
rlO 

a*Y* = 2-5 (Ui + Ut + Ug + Ug), 

Jo 


and show that the formula involves a small second difference error. 



EXAMPLES 


201 


f6 

II. Find the value of {i+xY^dx, 

Jo 

Obtain approximations to the value by applying 

(i) Weddle’s rule: 

[ u^dx = 0‘2 K + 5*'i + *^2 + 6w3 + W4 + 5W5 + We), 

Jo 

(ii) Simpson’s rule: 


\ uy.dx = \ (W(j + 4^1 + i/g), applied three times. 

J 0 

12. Which of the two following formulae would you expect to give 

the better approximation for u^dx} 

Jo 

i {5 (“o + «4) + 4 (“i + “s) + iSh,,}, 

(b) fV {2 (ua + u^ + ut) + iz (k, + Wa)}. 

13. Prove the approximate integration formula: 

n 

u^dx = n{^Ua + oV (1 ()u„ - S«2„ + « 3 «)}, 




j-l 

and hence find u^dx given the following table: 
0 


X o I 23 

27,650 31*252 35*^S4 39*368 

14. Prove that 

/W dx = i{fi-l)+f(h)} + ii {A/(-i)-A/(i)} approximately. 

Hence find f{x)dx, when /(o) = io5, /(i) = 2i2, /(2) = 42i, 

/ (3) = 749 and / (4) = 1050. 

15. If either increases continually or decreases continually as x 

rn n-1 

increases, prove that Uj^dx differs from S u^. by less than the differ- 
Ji 1 

ence between Ui and w„. 

Prove that the difference between log n and 
however great n may be. 
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16. If third differences are constant, prove that 

r2 

= oV («_j + 23“i + 23«J + «i)- 

Adapt this formula to find the approximate value of log^ 2 from the 

, {^^dx 
integral —. 

J a ^ 

17. Prove that, approximately, 

I tij^dx = 0*3 (i • Iz/_ 3 + 4-4m_2 + 1 + 4“o + + 4’4^^2 + ^*i%)- 


18. / (^) is a rational integral function of the fifth degree in x. 

Prove that 

I' /M dx=lfio) + ^{f{V^6)+fi-V^)). 

19. Use Simpson’s rule to prove that log^ 7 is approximately 1*95. 

20. Apply the Euler-Maclaurin formula to find a formula for the sum 
of the fifth powers of the first n natural numbers. 

21. Obtain Shovelton’s integration formula: 

rlO 

u^dx- j|-8 {8 (Wo + Mjo) + 35 («! + % + «7 + %) 

0 

+ 15 (tt2 + «4 + «6 + Wg) + 3 W- 

22. By means of Hardy’s formula 

[■G 

u,^dx~ •28W(> 4 -1-621/14- 2-21/3 4 -1 '621/5 4 - -281/6, 

Jo 

calculate the value of 

[^(x-x-^Y^dx 
J 0 


correct to four places of decimals. 

23. If/ (jc) be a function of the third degree in x, and if 

r-t rt r 3 ( 

= / (^) dx, = / {x) dx, Ml = / (:v) dx, 

J- 3 i J ~t Jt 

show that = 

24. AB is the base of a semicircle, centre O and radius unity. The 
points P and Q bisect OA and OB respectively. The area between the 
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semicircle, the base PQ and the ordinates at P and 0 ^ • Use 

Weddle’s rule that 

f V {X) dx = 0-3 {/ (o) + 5/(1) +/ (2) + 6/ (3) +/ (4) + 5/ (5) +/ (6)} 

Jo 

to find an approximate value of tt to three places of decimals. 

25. The following values of are given: 

^10123456 
Uj, *146 -161 -176 *190 -204 *217 *230 


Use an approximate integration formula to find the value of u.j.dx. 

J 0 

It is found that, for the values given, J'^^logio (•05^?+1-4) fits the 
data. Verify that this is so by integrating log^^ (•05X+1*4) between the 
limits o and 6. (logio e = '4343 i >ogio i -7 = ’ 2304 : login i -4 = -i 46 i-) 


26. If Uy, is a function whose fifth differences arc constant, J u,j.dx 
can be expressed in the form 

Find the values of q and a. 

Use this formula, after making the necessary changes in the origin 
and scale, to find the value of logg 2 to four places of decimals from the 
equation . 1 


- dx — loze^* 

Jol+X 


27. Prove that, if <3 = o, 


j + Ml + W2 + • • • + «r-i + lUr- iV 

"" ^ {^^^r-2 "fi ^^^0) y^O (^^^r-3 “ 

“ ri'o + A \).... 

If be the function (i +x^)~^ find an approximate value for tt. 


28. Obtain an approximate formula for Uj.dx in the form 

J -3 

<2(m_2 + M2) + ^(m_3 + M3) 
and find the values of a and b. 

29. Use Lagrange’s formula to show that 

[51783-04-308 (j-i4-3-1)-17 ( 3'-2 4 - 3 ' 2 )]. 



PROBABILITY AND ELEMENTARY 
STATISTICS 


CHAPTER X 

PROBABILITY 

1 . Suppose that a bag contains a hundred balls of exactly the 
same size and shape, of which one is coloured and the remaining 
ninety-nine white. If one of these balls be drawn at random it is 
safe to say that there is a greater probability of drawing a white 
ball than of drawing the coloured ball. Again, of a number of men 
in a community, all subject to the same conditions, the probability 
that one aged 20 will survive 10 years is obviously greater than the 
probability that one aged 90 will survive the same period. These 
examples serve to illustrate what is understood by the term ‘"prob¬ 
ability”. Many more such simple examples could be given, from 
which we could say that the probability that event A would happen 
is greater or less than that event B would happen. A difficulty 
arises, however, when we attempt to assign a measure to the 
probability that one of the events A or B might happen. 

In the first of the above examples it is not unreasonable to argue 
that since there is one coloured ball among a hundred balls, the 
probability that this coloured ball would be drawn is one in a 
hundred. Whether this be so or not, we have used the data at our 
command and have given a measure to the probability. In the 
second example, however, we have no data immediately available 
to enable us to assign a numerical value to the probability in 
question. It would be necessary to collect statistics of the mortality 
of the men in the community, and to supplement the collection 
of these statistics by more or less elaborate calculations. In the 
application of the theory of probability to most actuarial problems 
the aggregation of statistics is necessary before the data are avail¬ 
able, and it is not often that we are immediately in possession of 
such simple facts as are given in the first type of question. A com¬ 
plete answer to a problem in actuarial work would involve the 
collection and interpretation of the relevant statistics, and in many 
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of the questions on probability that are dealt with later it will be 
assumed that sufficient data are available to enable us to proceed 
to the required conclusions. 

2. In the study of probability we are concerned with the number 
of times that an event occurs on the average out of a very large 
number of occasions on which it is in question. The proportion of 
such favourable cases to the total number of cases may then be said 
to be the “probability of the happening of the event. For example, 
consider the following extract from a mortality table: 


Age 

Number of persons 

who are alive at exact 

X 

age X 

0 

1000 

10 

812 

20 

793 

30 

762 


This table may be interpreted thus. Out of looo births, 812 
children survive to age 10; of these 793 survive to age 20; of these 
762 survive to age 30; and so on. Since “living to age 10*’ is an 
event, and we start with 1000 births, this event is in question on 
1000 occasions; the proportion of times that this event happens is 
812/1000 or *812. 

*812 is therefore the “probability” that, according to the par¬ 
ticular mortality table given above, a child just born will survive to 
age 10. Similarly, the probability that the child will survive to 
age 20 is ‘793 and that it will survive to age 30 is *762. 

These probabilities are all averages, and being averages require 
adequate data for their calculation. In ordinary circumstances little 
can be argued from an event which happens on one occasion only; 
in probability nothing of value can be deduced from solitary in¬ 
stances. In fact, probability as applied to a solitary occasion rarely 
has any meaning; not only can few deductions be made from solitary 
instances, but in general they cannot be applied to future solitary 
occasions even if they have been drawn from sufficient data. 
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The collection and interpretation of the data necessary for the 
calculation of averages and probabilities is the 'work of the statis¬ 
tician, but it is the mathematician who is concerned with the laws 
followed by these averages and probabilities. It is worthy of note 
that although probability and statistics are in effect inseparable, the 
theory of probability was evolved from a very different source— 
namely, a consideration of gambling and games of chance. The 
association between the two sciences came at a much later date. 

3 . It will be observed that the term probability’* has been em¬ 
ployed in two senses. It has been used as the name of the subject 
itself, and also as the name of a ratio—the proportion of the number 
of favourable events to the total number of cases when the hap¬ 
pening of an event is in question. There is yet a third sense in which 
the word is frequently employed, namely in the sense of ‘‘credi¬ 
bility”. Such terms as “credibility” and “likelihood” in their 
ordinary colloquial meaning are incapable of exact mathematical 
measurement and with probability in this sense mathematicians are 
not concerned. 

In this and the following chapters the term “probability” will 
be used in the sense of an average, but in so doing it must be 
repeated that the study of the subject will be of no assistance in 
forecasting the results of solitary events. 

4 . When the cases which constitute the totality of possibilities 
for any event are counted, care must be taken that all the cases are 
strictly comparable. If we are considering the probability of living 
a number of years we must be careful to state whether we are 
dealing with male lives only or with female lives only or with an 
aggregation of both sexes; for the longevity of males is different 
from that of females. Similarly, if it be required to find the prob¬ 
ability that a coin of particular value is drawn from a bag containing 
a number of coins it must be stated at the outset whether or not all 
coins are equally likely to be drawn. If one of the coins were a 
half-crown and the others were all sixpences, it could hardly be 
said that it would be equally likely that any one particular coin 
would be drawn; the cases would not be homogeneous. 
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This important question of homogeneity arose quite early in the 
history of the subject. Errors were often made in the solution of 
what are now recognized as quite simple problems through the 
neglect of this principle. D’Alembert’s solution to the following 
problem is classic: 

What is the chance that heads will turn up at least once in two 
tossings of a coin? 

The solution given was as follows: 

“Only three different events are possible, namely, 

(i) heads the first time—which makes it unnecessary to 
toss again; 

(ii) tails the first time and heads the second; 

(iii) tails both times. 

There are thus two favourable cases out of three, and the 

required probability is therefore f.” 

The three cases of d’Alembert are not equally probable. The 
classification is wrong. This should be 



First toss 

Second toss 

(«) 

Head 

Head 

(6) 

Head 

Tail 

(c) 

Tail 

Head 

id) 

Tail 

Tail 


Of these four cases {a), (b) and (c) are favourable to the event in 
question, namely, “heads at least once”, and the required chance is 
therefore f. 

5 . At first sight it would seem that the examples above have been 
drawn from different sorts of experience. In paragraph 2 a result 
has been obtained from a consideration of a mortality table—which 
is a matter of records—while in D’Alembert’s problem we used an 
illustration based on the tossing of a coin—a matter of presumption 
and not of records. The difference between these two kinds of 
phenomena is real, but it does not render them incompatible in 
the study of probability, nor does it destroy the general idea of the 
average value. When we speak of tossing a coin we do not mean the 
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ordinary coin of commerce, but an ideal coin, which is a perfectly 
uniform circular disc with its centre of gravity at the centre of the 
coin. The same ideal conditions will be assumed to be present in all 
subsequent experiments with dice and cards. Thus a die will be a 
perfect cube and in a pack of cards each card will be assumed to be 
drawn as often as any other card in a large number of drawings. 
On these assumptions it is unnecessary to refer to records; we base 
our arguments on the definitions of the objects themselves. The 
illustrative examples that can be drawn from a consideration of the 
ideal objects are most suitable for demonstrations of the principles 
of probability; examples obtained from statistical records generally 
involve large and unwieldy figures. 


6. Definitions of the probability of an event. 

(a) The “unitary’’ or “a priori” definition. 

If an event can happen in a ways and fail to happen in b ways, 
all these ways being equally likely and such that not more than one 
of them can occur, then the probability of the event happening is 
cil(^ci “f b). 

If the probability that an event E will happen be denoted by py 
then/) == + A); this is called the “a priori” definition of prob¬ 

ability because we have reasoned from general considerations and 
not from statistical observations. Since, however, we have inserted 
the proviso that all the ways of happening are equally likely—or, 
to put it in another way, if the experiment were continued a very 
large number of times the different cases would occur with equal 
frequency—then we may say thatp — al{a-\-b) is the proportion or 
average of the number of times that the event E will happen. 

(b) The “statistical” definition. 

If on taking any very large number N out of a series of cases in 
which an event E is in question, E happens on pN occasions, the 
probability of the event E is said to ht p. 

This definition is of greater generality than the “a priori” 
definition; it gives the practical interpretation that the probability 
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of an event is a relative frequency. Further, it suggests that the 
probability arises from statistical investigation. 

The meaning of the statistical definition has already been ex¬ 
emplified by the probabilities of survival in paragraph 2. Further 
illustrations will be given later by reference to a correlation table. 

7. From the definitions above we may immediately deduce the 
following facts: 

(1) The probability of an event is a ratio. 

(2) The ratio can never exceed unity. 

The measure of certainty is unity. If an event is certain 
to happen the probability of its happening is i; if it is 
certain not to happen the probability is o. 

(3) The sum of the probabilities of the happening and of the 

failing of an event is unity. 

(4) If the probability that an event will happen is known, the 

probability of its failing is also known. 

Instead of using the fraction al{a-\-b) for the probability of an 
event, we may use an alternative form: iip = aj{a-{-b) and, there¬ 
fore, I —p = bl{a-\-b), then the odds in favour of the event are said 
to be a ; b. 

8. The simplest problems in probability are those in which both 
the number of favourable ways and the total number of ways in 
w^hich the event may happen can be counted, either arithmetically 
or by the aid of elementary rules. The enumeration is often best 
performed by the application of the theorems of permutations and 
combinations, and a thorough knowledge of this branch of algebra 
is essential for the speedy solution of many of these questions. 

Example 1 . 

An ordinary pack of cards contains 13 cards of each of four suits: 
spades, hearts, diamonds and clubs. Ten cards of each suit are numbered 
I (the ace), 2, 3 ... 10, the remaining three being “court cards’’, namely, 
knave, queen, king. 


F M A S 


14 
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Find the chance that if a card be drawn at random from an ordinary 
pack it will be a heart. 

Since there are 52 cards in a pack, the number of ways in which a 
card can be drawn (i.e. the total number of ways in which the event can 
happen) is 52. 

The number of ways favourable to the event is 13, since there are 
13 hearts in the pack. 

The required probability is therefore or 

Notes, (i) The above result means that over a long series of trials the 
proportion of favourable draws would approximate to one-fourth of the 
whole number of trials. It is convenient, however, to think in terms of a 
single trial, although the results of our calculations merely epitomize the 
experience of a long series of trials. 

(ii) The word “chance” may be taken to be the same as the word 
“probability”. There is a shade of difference between the two ideas, 
“ probability ” referring to past, present and future events, and “ chance ” 
to future events only. This distinction can be ignored. 


Example 2. 


Three cards are drawn at random from an ordinary pack. Find the 
chance that they are a king, a queen and a knave. 


The total number of different ways in which three cards can be drawn 
is and all these ways arc equally likely. 

The number of ways in wliich a king, a queen and a knave can be 
drawn together is 4®. 


4^, 3 ! 16 

The required chance is therefore - or --. 

52.51-50 5525 


Example 3. 

Out of 2W4-I tickets consecutively numbered, three are drawn at 
random. Find the chance that the numbers on them are in arithmetical 
progression. 

This problem can be solved by virtually writing down the sets of 
numbers that can be in a.p. Consider, for example, the sets of three 
numbers in A.P. beginning with 4: they will be 

4;5;6 4;6;8 4 ; 7 ; 10 ... 4 ; i ; 2w-2 4; « + 2;2?2 

where the highest number that can enter into any set is zn. There will 
evidently be « — 2 such sets of numbers (being the number of numbers 
from 5 to n 4-2, both numbers inclusive). We can therefore write down 
the following schedule; 
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Lowest number of the three 

1 

2 

3 

4 

5 

6 


Number of favourable ways 
n 

n—i 
n — I 

n — z 

71 —z 

«-3 
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271 — 2 I 

271— \ I 


The total number of favourable ways is 


2n(n+i) ] o j . 

—^- \ or and since 

2 


the total number of ways of drawing three tickets is 


(2Ai-r l) 271 (271— l) 


it is easily seen that the required probability is 

Example 4. 

If the letters of the word REGULATIONS be arranged at random, 
what is the chance that there will be exactly four letters betw^een the R 
and the E? 

The conditions are satisfied if 


the R is in the first place and the E in the sixth, 


or „ R 

>> 

second „ 

E 

,, seventh, 

R 

»> 

third „ 

E 

„ eighth, 

,, ,, R 


fourth ,, 

E 

,, ninth. 

„ R 

n 

fifth ,, 

E 

,, tenth, 

I. .1 R 


sixth ,, 

E 

,, eleventh, 


or if the R and the E are transposed in any of the above. 

Now there are 11 letters in the word; therefore the letters R and E 
can jointly occupy ii x lo positions. Since there are 6x2 favourable 
positions (as above) the chance that the R and the E are in the favourable 
position is or /g. 


Example 5. 

Find the probability that the number 8 will be thrown in a single 
throw with two dice. 

The total number of numbers that can be thrown with two dice is 
36, since each of the 6 ways of throwing the first die can be associated 
with each of the 6 ways of throwing the second. 


14*2 
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The number of ways favourable to the event can be found by simple 
enumeration, thus: 

First die Second die 

2 6 

3 5 

4 4 

5 3 

6 2 
Total: 5 ways. 

The required probability is therefore . 

9. Let us consider the simple question; if a die be thrown, what is 
the chance that an ace appears.^ 

There are 6 equally likely cases, one of which is favourable to the 
event. Therefore, by the unitary definition, the required chance 
is 

Again, by the statistical definition, we may say that since each 
face is as likely to appear as any other, in a very large number of 
throws (N) the ace will appear A^/6 times; the required chance is 
therefore as before. 

Whichever definition is employed it must be borne in mind that 
the probability J denotes the proportion of times that the ace would 
appear out of a very large number of throws, and that we are quite 
unable to predict on which occasions in a series of trials the ace 
will be presented. This leads us to examine more fully the meaning 
of the word ‘‘single” in such a question as “what is the probability 
of throwing an ace in a single throw with an ordinary die?” This 
point can best be explained by reference to an illustration similar 
to the first illustration in paragraph i. 

If there be five exactly similar balls in a bag, two coloured and 
three white, then by the unitary definition the probability that a 
coloured ball will be drawn (supposing one ball be drawn at 
random) is %. This does not mean that if a ball were drawn five 
times, being replaced each time, two of the draws would necessarily 
give coloured balls. The true interpretation of the fraction f is that 
if the experiment were repeated a large number of times a coloured 
ball would be drawn on the average twice in five times. 
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The meaning to be attached to the word “single” in the original 
question is that the records are to be per sets of one throw. 

In a similar manner, we may define the words “the next throw”. 
Suppose that it is required to find the chance that in two consecutive 
throws of a single die the numbers 2 and 3 will appear. The word 
“next” refers to a specific order, and what we set out to find is 
the proportion of times that the order 2, 3 appears in a very large 
number of trials. 


10. It has been remarked that illustrative examples in probability 
can most easily be given from a consideration of ideal objects, such 
as coins, dice, cards, etc. Examples from statistical records involve 
the collection and tabulation of statistics, and, moreover, often 
necessitate lengthy calculations. It is instructive, however, to con¬ 
sider the results that can be obtained from certain statistical records, 
and the following examples, based on what is known as a correlation 
table, are of interest. A correlation table may be regarded as a 
table giving the values of / (x, y), where x and y are two variables 
which it is known or suspected follow some kind of law of variation. 
In the table on p. 214 the Ji? variable is the age of husband, while the 
y variable is the age of his wife. The data are taken from the Census 
of 1921 and give to the nearest thousand the numbers of husbands 
in the various age-groups shown down the side with wives of the 
ages in the groups shown along the top. Thus, the figure 86 along 
the line (45-) and in column (50-) means that there w^ere, to the 
nearest thousand, 86,000 husbands aged between 45 and 50 at the 
time when the Census was taken who had wives aged between 50 
and 55. It should be noted that quinquennial age-groups have been 
taken in constructing the table, although more accurate results 
would be obtained by the use of individual ages. Apart from other 
considerations, it would, however, be impracticable to give for the 
present purpose a table at individual ages, and the results obtained 
below may be taken as sufficiently accurate. We may make the con¬ 
venient assumption that each husband has the age represented by 
the mid-interval value, and similarly for each wife. In other words. 



1921 Census 

Number of married couples for quinquennial age-groups of husband and wife (looo’s) 
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the number 86 may be read as indicating that there are 86,000 
couples of exact age 47J for the husband and 52J for the wife. 


Total 
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It should be noted that the values in the table are termed 
cieSy since they show how often that particular conjunction of ages 
occurred in the tabulation of the data. (See Chap. XI, paragraph 2.) 

We will now apply the figures from the table to answer some 
typical questions involving probabilities and averages. 

Note, The results will not be affected if the figures are read as they 
stand. In effect the values are all divided by 1000. 


Example 6. 

Find the probability that a husband aged 40-45 will have a wife 

(i) under 40 years of age, 

(ii) over 45 years of age, 

(iii) between 40 and 45 years of age. 


(i) We have to find p such that 

/)A^=number of husbands aged 40-45 having wives under age 40, 
= total number of husbands aged 40-45. 

44-22 + 84 + 291 401 . 


(ii) As above, 

= number of husbands aged 40-45 having wives over age 45, 
TV = total number of husbands aged 40-45. 


p=pNIN= 


99+ 15 + 3 + I _ 118 
964 ~9(H 


• 122 . 


(iii) 

= number of husbands aged 40-45 having wives aged 40-45, 
TV — total number of husbands aged 40-45. 


Note, The sum of the three probabilities obtained = ^ 

This is as it should be, for the three cases exhaust all those in which the 
husband is in the age-group 40-45. 


Example 7. 

Find the probability that a wife aged over 50 will have a husband aged 
less than 50. 
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If p is the required probability, then 

= number of wives over age 50 having husbands less than age 50, 
iV=total number of wives over age 50. 

AT/Tvr 14-4+184-105 128 

/. P - pN N = 7--7 - - -^-=-— 

676 + 497+ 343+215 + III +44+12+2 1900 


= *067. 

Example 8. 

In a sample of 10,000 husbands between ages 30 and 50, how many 
might be expected to have wives less than 40 years old? 

We require 10,000/), 

where A^ = number of husbands in the table between ages 30 and 50, 

number of husbands between 30 and 50 having wives less 
than 40 years old. 

/. lo^ooop—iOyOoopNjN 

I +64 + 390 + 864 + 9C3 

--^—+z-_z —^ X 10,000 

906 + 977 + 964 + 916 

_ 22,720,000 

3763 


=6038. 


11. Mutually exclusive events. 

If an eyent can happen in more than one way, all ways being 
mutually exclusive, the probability of its happening at all is the sum 
of the probabilities of its happening in the several ways. 

This proposition is sometimes known as the addition rule, and 
is fundamental in the application of the theory of probability. The 
rule may otherwise be stated thus: 

Ply p2y ••• Pn l^e probabilities of n mutually exclusive 
events, the probability that one of these events happens on any 
occasion in which the n events are in question is 

/>i+/>2-b/>3+ ••• 

The words “mutually exclusive’^ in this proposition are of the 
utmost importance. A proof of the rule depending on the statistical 
definition of probability will show that unless the events are 
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mutually exclusive we cannot add the simple probabilities. Thus, 
if there be a large number N out of a series in which the n events 
are in question, the first will happen on p^N occasions, the second 
on p^N occasions, the third on p^N occasions, and so on. Since the 
events are mutually exclusive, one and only one of the events can 
occur on any one occasion. Therefore, out of N occasions, one or 
other of the events will happen on p^NP2N-j-p^N + ... -{-p^N 
occasions. The probability that one of the events happens is there¬ 
fore (P1N+P2N+P2N+ ... -{-pnN)IN or /)i+/>2+/>3+ ••• 

It will be seen that this proof breaks down and the proposition 
is not true if any one event happens on any one of the occasions 
that any other happens, i.e. if the events are not mutually exclusive. 

It follows easily that if instead of keeping in view separate events 
we consider the various ways in which a single event may happen, 
the probability of the happening of the event is the sum of the 
probabilities of the mutually exclusive ways in which the event 
may happen. For, if there be n different ways in which the event 
may happen and p^ is the probability that the event happens in 
the rth out of the n different ways, then if the event must happen, 

r=n 

21 is the total number of occasions on which it happens in 

r-«l 

N trials. Since the event always happens, Ilp^N = N, so that 

I,pr=l. 

The whole number of ways in which an event may happen is 
sometimes called the universe of the event. If an event may fail, 
the failing of the event is also included in the universe. 

The following example illustrates the principle outlined above. 

Example 9. 

What is the chance of throwing a total of 3 or 5 or ii with two dice? 
The three events, i.e. throwing 3, throwing 5 and throwing ii, are 
mutually exclusive, since the occurrence of any one of them precludes 
the occurrence of any other. 

We require the proportion of times out of a large number of trials in 
which one or other of the three events will happen. 

Let there be a large number of trials N, Then since the chances of 
throwing 3, 5, ii are respectively, totals of 3, 5, ii will be 
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obtained on occasions respectively. The total number of 

times on which one or other of the events will happen is therefore 

Hence, by definition, the probability of throwing a total of 3 or 5 or 11 
is f, which is the sum of the separate probabilities of the three mutually 
exclusive events. 

In practice it is sufficient to apply the addition rule at once and to 
write the solution as 3^^ + + 3“^ = |. 


12. Compoimd probability: independent events. 

If there be a series of events such that the happening of any one 
of them in no way affects the happening of any other of them, the 
events are said to be independent. 

The probability that two independent events happen on any one 
occasion on which they are both in question is the product of the 
chances of their happening severally. 

For consider two independent events E and E\ the probabilities 
of which are p and /)' respectively. Out of a large number N in 
which the events are in question E will happen on pN occasions, 
and out of these pN occasions E' will happen on />' {pN) or pp' N 
occasions. The probability that they both happen is therefore pp\ 
Similarly, it may be shown that if there are n independent events 
the respective probabilities of which are pi, p2ip3y Pn^ f^e 
probability of the joint happening of them all is P1P2P3 ••• 

We may use the a priori or unitary definition of probability to 
prove this proposition. 

Let there be two independent events the probabilities of which 
are p and p' and let p = al{a + b)y p' — a'l{a' + b'), where a and b 
are respectively the numbers of favourable and unfavourable 
equally likely ways in which the first event can happen; and 
similarly for a' and b' with regard to the second event. 

Then there are (a -f- b) {a' + E) equally likely ways in which the 
two events are jointly in question. Of this number there are aa' 
equally likely ways favourable to the joint happenings of the two 
events. 
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Hence, by the unitary definition the probability of the joint 
event is 

{a + b) {a' + b')~ 

where P is the required probability of the joint event. 

In a similar manner we may prove the proposition for n inde¬ 
pendent events. 

This rule is known as the multiplication ruby and its application 
to the probabilities of happening or otherwise of two independent 
events may be seen from a consideration of the following schedule. 

Let the probabilities of happening of two independent events be 
p and p' respectively. Then 

(i) the chance that they both happen =/>/)'; 

(ii) the chance that the first happens and the second fails 

==p{i-p'y, 

(iii) the chance that the second happens and the first fails 

=p'{i~p); 

(iv) the chance that they both fail = (i -p) {i ~p')> 

The total of these chances 

-PP'+P (i ~P')+P' (i -/>) + (! ~P) (i -/>') 

= PP' +P-PP' +P' -PP' + i--p~-p'+pp' 

= L 

which is obviously true, since we have exhausted all the possibilities. 

13 . The application of the principles enunciated above can best 
be appreciated by reference to actual examples. 

Example 10. 

Find the chance that (i) three heads, (ii) two heads and one tail will 
turn up in three successive spins of a coin. 

(i) The chance of a head at the first spin = I 
,, ,, second ,, — ^ 

,, ,, third ,, 

Combined chance = 
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(ii) The chance of a tail is the same as that of a head, so that the 
total chance might appear to be as before. This is in fact the chance 
that 2 heads and i tail occur in a specified order, say, HTH. The con¬ 
ditions of the question would, however, be satisfied if the order were 
HHT or THII. These three favourable events are mutually exclusive. 
The chance that the two heads and tail are obtained in one of the three 
orders is found by adding the three probabilities. The required chance 
is therefore 3 x (|)'^ = 

Example li. 

Q is the probability that a man aged x will die in a year. Find the 
probability that out of 5 men, By C, Z), E, each aged Xy A will die in 
the year and be the first to die. 

The chance that a given man dies in the year = Q. 

The chance that a given man does not die in the year =1 ~Q, 

The chance that none of the five dies in the year = (i — Qy\ 

The chance that at least one man dies in the year = i - (i --Qy, 

Since the chance that A is the first to die is obviously and since 
this is independent of the chance that at least one man dies in the year, 
the required chance = J — Qf}, 

Example 12. 

A throws 3 coins and B throws 2 coins. Find the chance that A will 
throw a greater number of heads than B, 

(a) The chance that A throws 3 heads is (J)^. The chance in this 

case is x i, since B must obviously throw less than A, 

(b) The chance that A throws 2 heads and i tail is 3 (|), for he 

may throw HHT, IITH or THH. For B to throw less he 
must not throw 2 heads; the chance of this is i — 

The chance in this case is 3 (A)^ [i 

(c) The chance that A throws i head and 2 tails is 3 (|) (|)‘^. 

In order that B may throw less he must throw 2 tails, the 
chance of which is (J)^. 

The chance in this case is 3 (|)^. 

The three contingencies («), (b) and (c) exhaust all the cases for the 
required event and they are mutually exclusive. 

The result is therefore (i)® + 3 [i +3 (i)^ or, on simplifica¬ 

tion, 
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14. Compound probability: dependent events. 

If two or more events are not independent we may still apply the 
multiplication rule. Thus, if p be the chance of happening of an 
event, and p' the chance of happening of a second when the first 
has happened, then the chance that they both should happen ispp\ 

We may prove the proposition in the following manner. 

Let there be two events E and E' which are to happen in that 
order. Let the probability of E be p and the probability of E' after 
E has happened be p'. 

Then if the double event is in question on a very large number 
of occasions the number of times that E happens is pN —which 
may also be assumed to be large. Out of these pN occasions E' 
happens on p' (pN) occasions. Therefore out of the total occasions, 
Ny there are pp' N occasions on which the event E' follows the 
event E, It follows that the probability of the double event is pp'. 

By similar reasoning we may show that if there be n events 
El, E2y ... E,^ such that the probability of the (r-f i)th event after 
the previous specified r consecutive events have happened is pj.^iy 
then the probability of the happening of the n events is given by 
the product p^p^ ... pr •••pn- 

Care must be taken to distinguish between independent and 
dependent events. In considering independent events the hap¬ 
pening of either event does not affect the happening of the other, 
and to find the probability that they both happen we multiply the 
simple probabilities of each event. Where the events are not inde¬ 
pendent, we must find the chance/)2 that the second happens when 
the first has happened before we can apply the multiplication rule. 
For example, if there be two urns, one containing one white and 
two coloured balls, and the other one white and three coloured 
balls, the probability that the combined result of two drawings, 
one from each urn, will give the two white balls is J x j, since the 
drawing from the first urn in no way affects the drawing from the 
second. If, however, we were required to find the chance that in 
two successive drawings from an urn containing two white and 
five coloured balls the two white balls would be drawn (a ball 
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drawn not being replaced) we should reason thus. The chance that 
a white ball is drawn at the first drawing is f; after this has happened 
there will be left one white and five coloured balls, and the chance 
of drawing the other white ball at the second drawing will be 
The combined chance is therefore f x 

15 . It has been shown in paragraph 12 that the multiplication rule 
can be applied to the probability of happening of any number of 
independent events, so that if />, p\ p'\ ... be the chances of the 
events happening severally, the chance that they all happen is 
ppp'\.,. 

Again, if the chance that an event happens in a single trial be^, 
the chance that it will happen in each of r trials is^^, and the chance 
that it will fail in each of, say, {n — r) trials is (i -py-^. 

Suppose that it is desired to find the chance that an event will 
happen exactly r times in n trials. 

Let the chance that in any one trial the event does not happen 
be ?(=i~/>). Then if the event happen in r trials and fail in 
n — r, the combined chance is But of the n trials we can 

select the r trials in which the event happens in «(,.) ways. The 
chance that the event happens in exactly r trials out of the n is the 
chance that it happens in these r trials and fails in the remaining 
n — r trials. The required chance is therefore 

Now it will be noted that is the term containing p"^ in 

the expansion of {p-\-qy^> The successive terms of this expansion 
will therefore give the probabilities of the event happening 
72 , n—i, n —2, ... n~r, ... times exactly in n trials. 

16 . The following examples are illustrative of the principles out¬ 
lined above. 

Example 13. 

Find the chance of drawing a king, a queen and a knave in that order 
from an ordinary pack in three consecutive draws, the cards drawn not 
being replaced. 

This is an example of the application of the multiplication rule to 
dependent probabilities. 
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If />! be the chance of drawing a king; 
p2 „ „ » a queen, a king having been drawn; 

,, ,, „ a knave, a king and a queen having 

been drawn; 

then the chance of drawing a king, a queen and a knave in succession 

is PiPzPz^ 

Now there are 4 kings in the pack, so that the chance of drawing a 
king is When the king has been drawn there are 51 cards left of 
which 4 are queens. The chance of drawing a queen is therefore . 
Simi larly, the chance of drawing a knave from the remaining 50 cards is . 

P\ ~ b'l » p2 ~ J Pz " 5V • 

The required chance =/)i/)2/)3 = 

[Compare Example 2, where the three cards are drawn at random (not 
in a specified order) and the problem is treated as one in simple 
probability.] 

Example 14. 

A bag contains three balls, one red, one white and one blue. and Y 
draw a ball at random alternately. If X draws the red ball or Y the white 
ball it is retained. Otherwise the ball drawn is immediately replaced. 
Find the chance that just before the fifth draw is made the blue ball 
only is in the bag. 

Let R, Wy B denote the red, white and blue ball respectively. Now 
the blue ball is the only one in the bag just before the fifth draw if any 


of the following have happened : 

A Y 

first draw first dravv 

X 

second draw 

Y 

second draw 

(i) 

R 

B 

B 

W 

(ii) 

R 

B 

W 

W 

(iii) 

B 

W 

R 

B 

(iv) 

W 

W 

R 

B 

(v) 

R 

W 

B 

B 

(Vi) 

B 

R 

R 

W 

(vii) 

B 

B 

R 

W 

(viii) 

W 

R 

R 

w 

(ix) 

W 

B 

R 

IV 
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The chances of these events happening are : 

(i) i \ h 


Product 

-h 

(ii) 

1 

i 

1 

4 

^4 

(iii) 

i 

i 

1 

I 

tV 

(iv) 

X 

3 

i 

k 

I 

A 

(v) 

1 

3 

i 

I 

I 

1 

6 

(Vi) 


\ 

1 

1 

A 

(vii) 

i 

k 

i 

\ 

-h 

(viii) 


1 

3 

t 

X 

JL 

54 

(ix) 


i 

X 

3 

1 

J_ 

5 i 

The total chance 

is therefore 

2 _u 2 , 1 , 4 _ 

24" + T8 + '6 54 ~ 

47 

10 8 * 



Note, A systematic enumeration of possibilities is of great importance, 
and although in the above example the work may be shortened by 
alternative methods, for illustrative purposes the eventualities have been 
set out in full. 


Example 15. 

A bag contains three red and three green balls and a person draws out 
three at random. He then drops three blue balls into the bag and again 
draws out three at random. Show that he may just lay 8 to 3 with 
advantage to himself against the latter three balls all being of different 
colours. 


After the insertion of the three blue balls the bag may contain: 


Red Green Blue 

(«) 3-3 

{b) 2 I 3 

(f) I 2 3 

{d) - 3 3 


The probability that the bag contains {a) is l.l.s or jV. the 
chance that a green ball is drawn is obviously , the chance that a second 
green ball is drawn is f and the chance of a third green ball is then J. 
Similarly, the chances under (b), (r), {d) are respectively. 

Now three different colours on the second draw can be obtained only 
if the six balls come under headings {h) and (r). 

Under {b) the probability at the second draw of drawing three different 
coloured balls is 7,%, and therefore the compound probability that this 
will happen is • Under (r) the probability is the same. 
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The chance required is the sum of the chances under (h) and (c) 
and amounts to . Therefore, since the odds against the favourable 
happening are 73 to 27, the person drawing the balls may lay 8 to 3 
against his drawing three halls of different colours and obtain a slight 
advantage. 

17 . The methods for the solution of the above examples depend 
largely on the simple application of the formulae for permutations 
and combinations. It is safer as a general rule to use permutations 
rather than combinations. If repetitions are not allowed this is not 
very material, as each combination of r things forms r\ permuta¬ 
tions. If, however, repetitions are allowed then in most instances 
it is essential to use arrangements. 

Other algebraic devices are often useful. For example, in dealing 
with qucvstions involving the sum of the numbers that can be thrown 
with dice and kindred problems dealing with homogeneous pro¬ 
ducts, it is of advantage to employ the binomial theorem. 

The following examples illustrate the use of this method. 


Example I 6 . 

Four dice are thrown. Find the chance that the sum of the numbers 
appearing will be 18. 

Regard being had to the different ways of making up the same total, 
the number of numbers that can be thrown with four dice is the sum 
of the coefficients in the expansion 

(jc + -I- -f X^y, 

The sum of the coefficients will be found by putting x=i. The total 
number of possible numbers is therefore 6^. 

The number of ways of throwing 18 is the coefficient of in the 
above expansion. 

The coefficient of in {x + x^ + + x^x^x^y 

— ,, x^^ m x^ {1 + Xx^ oc^ + x^y 

= „ in (i+ + 

... (i— 

"" ( 73 ^ 

= „ x^* in (i (i-x)-* 


F M A S 


15 
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=The coefficient of in (i —+ ^ iox^+ ... 

+ 2ll^'_V+ ......^ 

() 6 
80 

The required chance is therefore 

Note, We are here dealing with arrangements which would be dis¬ 
tinguishable if one die were red, one blue, one yellow and one green. 
Selections in an example of this type would give quite wrong results. 

Example 17. 

Nine cards arc drawn at random from a set of cards. Each card is 
marked with one of the numbers i, o or — i, and it is equally likely that 
any of the three numbers will be drawn. Find the chance that the sum 
of the numbers on the cards thus drawn is zero. 

The number of favourable drawings will be the coefficient of in 
the expansion of + the total number of possible drawings 

will be the sum of all the coefficients in the same expansion. 

The coefficient of .r® in (x~^x^ + x^y 
= ,, xP in x~^ (i 

= „ x^ in {i + X + x^Y 

= ,, in (i — (i — 

Proceeding on the same lines as in Example 16, the required prob- 
7IIQ 

ability is found to be 

18 . The theory of probability was evolved from a consideration of 
games of chance, and many problems dealing with these games 
can be solved by the elementary methods outlined above. No new 
principles are involved; all that is required in attacking these 
problems is a clear understanding of the particular happenings 
that may arise. The following examples are illustrative of the 
methods: the first of these examples is analysed fully, and the 
student should be able to solve problems of the same type by the 
application of similar reasoning. 
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Example 18* 

A and B throw alternately with a pair of ordinary dice. A wins if he 
throws six before B throws seven, and B if he throws seven before 
A throws six. If A begins, show that his chance of winning is | 
(Huyghens’s Problem.) 

First, let us consider the chances of throwing six or seven with two dice. 
We can find the number of ways of throwing these numbers either by 
actually counting the number of ways (as in Example 5) or by finding the 
coefficients of and x'^ in the expansion of {xx^ ■¥ x/^ + x^x^Y 

(as in Example 16). Since the figures are small, the first way is simpler, 
and it is easily seen that six can be thrown in 5 ways, and seven in 6 ways. 
The chance of throwing six in one throw with the two dice is therefore 
/g , and of throwing seven ^. 

A can win if he throws six the first time. His chance of throwing this 
number is . 

He may fail to throw six the first time. He can then win if B fails to 
throw seven at his first throw and A throws six with his second throw. 

The chance that A fails to throw six is i ~ ^ = 3 6“* 

The chance that B throws seven is 3^ and the chance that he fails 
to throw this number is i — 3% = |^ • 

Therefore the chance that A wins at the second throw is 

If A fails to win at the second throw, he can win at the third throw if 
B has not thrown seven at his second throw. The chance of this is 


io 3^1 3^ ^ 

30••36-^6 *36 


and so on. 


. ]’s chance of winning in the long run, i.e. after a very large number 
of trials, is therefore 


3 0 ^1 3 0 5_ , 

3 6 • 3 6 • 3 6 * 3 6 • 3 0 


=^h{i+r + r^ + i^+ ...)> 


where r = f|.If. 

If we sum this geometrical progression to infinity, we find that ^'s 
chance 


5 ._L. = _ 5 _ 1296 ^30 

36 I —r 36* 366 6i * 


Example 19* 

A and B play a match to be decided as soon as either has won two 
games. The chance of either winning a game is ^ and of its being 
drawn ytr* What is the chance that the match is finished in 10 or less 
games.? 


15-2 



228 


PROBABILITY 


If the match is not finished in lo games, the following must occur: 

(i) All the games must be drawn; or 

(ii) A or B must win one game and the remaining 9 must be 

drawn; or 

(iii) A and B must each win one game and the remaining 8 must 

be drawn. 

The chances of these mutually exclusive events are: 



since this result may occur in 10 different ways and either A or B may 
win; 

since the number of orders in which this may occur is 

The chance that the match is not finished in 10 games is, by the 
addition rule. 


\IO/ \lO/ 20 \lo/ \20/ 


_9"-387 


i.e. the chance that the match is finished in 10 games or less 

9^.387 . 

= I _ £—= -ly approximately. 


Example 20* 

Ay By Cy D each throw two dice for a prize. The highest throw wins, 
but if equal highest throws are made by two or more players, those 
players continue. A throws 9, B throws 7. Find C”s chance of winning 
the prize. 

Consider the following scheme, showing the total number of possible 
ways in which C mfay win: 
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C may win if 

C throws 12, D less than 12 . 

C throws 12, D 12, and C wins later 

C throws 11, D less than 11 

C throws II, D II, and C wins later 

C throws 10, D less than 10 

C throws 10, D 10, and C wins later 

C throws 9, D less than 9 and C wins later 
(since A has thrown 9) 

C throws 9, D 9, and C wins later 

The total of these chances is 

19. Examples based on mortality tables. 

A mortality table is a table of the successive values of 4 for 
integral values of x, where 4 is the number of persons out of 4 born 
who attain precise age x. We are not here concerned with the con¬ 
struction of a mortality table from the original data. For the present 
purpose we may, however, note that 

(i) the table is constructed from data sufficient for us to use 

the N of the statistical definition of probability; 

(ii) the successive values of 4 are not absolute, but relative, 

referring to a common “ radix’’ chosen for convenience 
of tabulation. 

In explanation of this second point it may be stated that the 
radix may be any number and may be taken anywhere in the table. 

♦ The final factor in each of these terms expresses the chance that C will 
win after having equalled the number thrown by D and/or A, 


Probability 
1 35 

6*‘36 

1 I 1* 

6‘^ 6^ 2 

1 33 
62'36 

2 2 
^' 62-2 

1 30 
62-36 

3 3 I* 

62 "62 2 

4 26 I * 
6^ 36*2 

441* 

62 - 62-3 • 
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For example, the radix may be 1000 at age 50, so that — 1000; it 
may be 10,000 at age 20 or even 1,000,000 at age o. The purpose 
for which the table is constructed determines the position and 
magnitude of the radix. 

In the following table the radix is 1,000,000 at age o; for the 
purposes of illustration only that portion of the table from age 90 
to the end of the table has been taken. 

English Life Table No. 10 (1930-32). Males 


Age jc 

Number living 
at age x 

Ir 

Number dying between 
ages X and jc + i 

4 4+1 

90 

I 6090 

4600 

91 

I 1 490 

3478 

92 

8012 

2564 

93 

5448 

1841 

94 

3607 

1287 

95 

2320 

873 

96 

1447 

574 

97 

873 

3^''4 

98 

509 

223 

99 

286 

132 

100 

154 

74 

lOI 

80 

41 

102 

39 

21 

103 

18 

10 

104 1 

8 

8 


In the 4 column we have the number 16090 at age 90, while in 
the same column we have the number 873 at age 97. This does not 
mean that precisely 16090 persons aged 90 were observed and that 
seven years later there survived exactly 873 of these persons. The 
numbers of persons actually observed at ages 90 and 97 were quite 
different. We may, however, use the two numbers given at these 
ages to obtain the probability that a person aged 90 shall survive 
to age 97, thus: 

If p is the required probability, thtnpN— 873, where N= 16090. 
p is therefore j 5 >A^/iV’= 873/16090=*05426. 
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It should be noted that each value in the mortality table is the 
radix at its own age for all subsequent values in the table. For 
example, if 47/^0 gives the probability at age 90 of living 7 years at 
least, so also does /95//91 give the probability at age 91 of living 
4 years at least. Generally 4 +n /4 gives the probability at age x of 
living n years at least. 

The following example is based on the mortality table given 
above. 


Example 21. 

The ages of three men, Ay B and C, are 90, 91 and 92 respectively. 
Find the following probabilities: 

(i) that Ay B and C will be alive in two years’ time; 

(ii) that one at least of the three will be alive in two years’ time; 

(iii) that exactly one of the three will be alive in two years’ time; 

(iv) that exactly two of the three will be alive in two years’ time; 

(v) that all will be dead in two years’ time. 

The required probabilities are 

^90 4l 42 16090 11490 S012 

= I - (l -8012/16090) (l -5448/11490) (l -3607/8012)= *855. 

= *394 on substituting and simplifying. 


(iv) Y-r(^-r) + P 

^90 ^91 \ ^ 92 ' *90 

= *355- 


^ 92 \ 

^90^ 4l ^2 
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Note, If we add the results under (i), (iii), (iv) and (v) we obtain 
unity. These exhaust all the possibilities. The result under (ii), however, 
is not included, and it is instructive for the student to discover why 
this is so. 

The following are further examples involving the probabilities 
of living and dying, not based on the figures in the table on p. 230. 
'Fhe same principles are involved in the solutions of these questions. 


Example 22. 

Three men were known to be alive five years ago when their ages 
were 31, 48, 69. Assuming that of 98 males born together one dies 
annually until there are no survivors, find the chances that 


(1) all are alive now; 

(2) none are alive now; 

(3) one, and only one, is alive now; 

(4) two are alive and one dead. 


The chance that a man aged x dies in five years is clearly 
The required probabilities are 

( t \ 62 4A 24. 

67 • 50 * 29 » 

(-) 

(3 ) f I • /o * A + similar expressions; 

(4) -fo • A + two similar expressions. 


5 

98 - AT* 


As the above are the only possible events that can happen, the total 
of the chances is unity and the reader should verify this. 


Example 23. 

Three men, P, Q and P, are each of exact age 96. Find the chance 
that they will all die at different ages last birthday in the order P, Q, R 
given that 

Chance of dying before 
Exact age next exact age 

96 i 

97 f 

98 i 

99 I 

We have to find the chance of their dying in a given order. There will 
be no need therefore to take into consideration the different orders in 
which the men may die. 
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(i) P may die at age 96 Chance = 

Q may die at age 97, i.e. may survive 

a year and die in the following year „ =(i 

R may die at age 98 or later; i.e. may 
survive 2 years and die in the 

following year „ =(i - J) (i - J) i. 

The total chance under this heading is therefore 

(ii) P may die at age q6 Chance = L 

Q 9S 

R .. 99 .. =(i-l)(i-i)(i-l) I- 

Total chance under heading (ii) 

(iii) P may die at age 97 Chance = (i — i) f. 

o „ 9S 

R .. 99 .. =(i- -)(i-i)(i-4) I- 

Total chance under heading (iii) 

=(I -1) 1 X (i - 2 ) (i - ii) 4 X (l - i) (i - i) (I -I) I = gy. 

Required chance = 3^0 + 384 + ^ = • 

20 . A simple problem in probability may sometimes be capable 
of more than one reading. When giving the solution to a question 
which may be construed in more than one way it is essential that 
the assumptions on which the solution is based should be stated 
at the outset. 

The two examples given below illustrate this point. 

Example 24. 

What IS the chance that a hand of five cards contains a pair of two 
like cards of different suits 

If the problem is to find the chance that of five cards two are to be like 
cards and the other three unlike, a result is obtained which is quite 
different from the chance that there are to be two like cards, it being 
immaterial what the other cards are. Moreover, this second reading is 
capable of two alternatives: there may be two like cards and two other 
like cards (e.g. two kings and two fours) in the five cards; or there may 
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be three or more like cards—this is not expressly excluded by the 
question. 

It will be instructive before setting down all the chances that are 
possible to examine what would seem to be the obvious solution. 

Let N denote the number of ways of selecting five cards from 52. 


Then 

5! 

The probability of drawing exactly two like cards of given denomina¬ 
tion (aces, say) will be 

4-3 48-47-46 

2! 3 ! 2162 

- — - or — , 

N 54145 

and since there are 13 different cards in each suit it might be thought 
that to multiply the above fraction by 13 would give the result. This 
answer is, however, incorrect, in that there is “overlapping’*, for the 
remaining three cards out of the five may be a pair of like cards w^hich 
have already been counted. 

The possible arrangements, with their respective chances, are 


Arrangement 

(a) 4 like cards and i different 


{b) 3 like cards and 2 like cards 


{c) 3 like cards and 2 different 


(d) 2 sets of 2 like cards, i different 


Chance 


4-3-2 4.3 

13 X 12 X—;"* 

3 1 2 1 

4165 

6 

TV 

4165 

4.3.2 12.11 

I3X? 

88 

N 

4165 

^3-12 4.3 4.3 

2! 27 2 1 "^^ 

198 

N 

41'^ 


(e) 2 like cards and 3 different 


4*3 q 


12.II.10 


3! 


N 


or 


1760 

4165" 


The reasoning to obtain these chances is straightforward, and it will 
be necessary to examine one only: the others are on the same lines. 
Consider the arrangement in which there are to be exactly two like 


• 44 because 8 cards are unallowable, being 4 of each of the previous numbers 
chosen. This avoids duplication with (6). 
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cards and consequently three different ones. Having settled on the 
denomination of the like cards—which can be done in 13 ways—there 
remain 12 denominations from which to choose 3. This can be done 

in ways. The two like cards can be selected from the four 

(one of each suit) in ways. This does not quite complete the selec¬ 


tion, for the three different cards may be selected from any of the four 
suits, and this can be done in 4 x 4 x 4 ways. 

Returning now to the analysis above. If the problem is confined to 
the chance where exactly two like cards are to be chosen, the result will 
be (e ); if there are not to be more than two like cards, two sets of two 
like cards being permitted, the chance will be if there are 

no restrictions, and we are given that two like cards at least are to be 
among the five, we shall require the total of the chances (<2) to (e). 

These results are respectively 4i6§> li Si! • 

The above method of solution is designed to show, in full, the several 
probabilities of the various arrangements that may occur. If, however, 
it is known that there are no restrictions other than that there shall be 
at least two like cards, a simpler method of solving this problem obviously 
presents itself. It is sufficient to calculate the probability that there shall 
not be any like cards among the five; this probability is evidently the 
complement of that required. The probability that there are no like 
cards is 


13.12.II.10.9 


ii 

N 


X 4^ 


2112 

4165’ 


the complement of which agrees with the third answer given above. 


Example 25. 

A network of wires forms squares like a chess-board. Two spiders, 
starting at the same time from opposite ends of a diagonal, crawl to the 
end opposite them at the same speed and by one of the shortest routes. 
What is the chance that they meet.? 

Let the square AQBR be the boundary of the network. Since the 
spiders start from A and B at the same time and crawl at the same pace, 
after having crawled for the same length of time they must arrive at 
points on the diagonal OR, They will meet if the points on QR are 
identical, but not otherwise. Also, any point on the diagonal relevant 
to the problem must be an angular point of one of the squares of 
which AQBR is the boundary. The number of squares which have 
diagonals lying along QR is «. 
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Let P be such a point with coordinates (jc, y) reckoned from A as 
origin and AR and AQ as axes. Then x-\-y = n, and x and_y are integers. 



AX R 


The number of routes from ^4 to i? is , for any route consists of n 
steps in the direction A to R and n steps in the direction A to Q, These 
steps may be arranged in any of {zn )!/(« !) {n !) ways. As all these routes 
are equally likely, the chance that any one route is taken is i/""C„. 

But the number of ways of arriving at P will vary according to the 
position of P. The number of ways of arriving at P from A isn \/{x !) (y !) 
or (since x-i-y = n), and this is also the number of ways from P to P. 
Therefore, of the routes from A to P, pass through P, and 

the probability that a route passes through P is thus . 

The chance that the spiders meet at P is therefore ~ 2 np 

X" n 

the chance that they meet at all is . 

Now this result is based on the supposition that all routes from A to B 
are equally likely. It may, however, be argued that once the diagonal 
QR is reached, the event has or has not happened, so that it is un¬ 
necessary to consider routes beyond the diagonal. On this assumption 
all routes to the diagonal are supposed equally likely. 

Thus, to reach P from A^ the first spider may take one of different 
routes. To reach the diagonal, the number of possible routes, each of 

x^n 

which is, under the present assumption, equally likely, is S 

ic-O 

The probability that each spider reaches P is therefore and 

that they both do so is the square of this expression. 

Therefore the probability that they meet at all is 

x^n 

S ("CJ* 

£=0 _ 

2‘in * 

yyiQ =^Cq 4-”Cj+”C2 + — 2 ^ 


since 
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It should be noted that, while the numerator of the fraction denoting 

x^n 

the probability under the first assumption—namely D —cannot 

a: = 0 

be evaluated in the general case, the numerator of the corresponding 
fraction under the second assumption can easily be shown to be 

The two results are not the same. While a general proof is difficult, it 
is a straightforward matter to show that in a special case the numerical 
values are different. Thus, if « = 8, the probability in the first case is 
about *208 and that in the second about *196. 

21. The method of induction. 

There is a certain type of problem in probability for which it 
is not possible to obtain a solution by the direct methods outlined 
in the preceding paragraphs. In these problems it is necessary to 
find a relation connecting the chance at any stage with that at suc¬ 
ceeding stages and then to calculate the required probability by 
adopting an inductive process. The difficulty in these questions is to 
ascertain the fundamental relation: when this has been established 
the problem can be solved by the application of algebraic methods. 

Since we have to obtain a relation connecting the probability of 
an event of the nth stage with those at succeeding stages, it is often 
of advantage to investigate the relation for simple numerical values 
of n and then to deduce the general result. The following is an 
excellent example of this method of solution. 

Example 26. 

Five green balls and sixteen red balls are placed in a bag. A ball is 
drawn at random n times in succession and replaced after each drawing. 
Find the chance that no two successive drawings shall have given green 
balls. 

To satisfy the required conditions the events must have taken place 
as follows: 

At the first draw either a red or a green ball must be drawn: 

R 

G 

At the second draw one of the following draws must have taken place: 

R R R G 

G R 

i.e. first drawing associated with R; or R G. 
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At the third draw: 
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R R 
G R 
R G 


R 

R 

R 


R R G 
G R G 


i.e. second drawing associated with R\ or first drawing associated with 
R G, 


At the fourth draw: 


R R R 
G R R 
R G R 
R R G 
G R G 


R 

R 

R 

R 

R 


R R\R G 
G r\r G 
R g\r G 


i.e. third drawing associated with R\ or second drawing associated 
with R G, 


It is evident therefore that if we have attained success at the (n — i)th 
stage, i.e. if no two successive green balls have been drawn, we shall 
attain success at the wth stage provided that, either 

(i) the nth. draw gives a red ball; 

or (ii) if the nth draw gives a green ball the drawing at the (n~ i)th 
stage gave a red ball. 

The chance of drawing a red ball = I®, and the chance of drawing 
a green ball = ^T- 

If therefore be the required chance after n drawings, we shall have 

_ ]_6 .. 1 _6 16*/ 

"n “ 2 f “n-1 2 r * 2 1 -2 • 

This is a relation connecting three successive coefficients of the series 

Uq + U^X + U^x^ + ... -f- 2^^““ + + ... . 

The series is therefore a recurring series with scale of relation 

T — JL& V — I 6 y2 

^ 21 21 • 

Also Uq=i and = i, since the conditions are satisfied if no ball or 

one ball is drawn. 

Proceeding in the usual way, we find that the generating function is 


Uq-^x {u-i- ihujzi) 


or 


I + sx/zi 


(l +4^/21) (l —zoxjzi) (l +4JC/21) (i —zox/zi) 


^ 1_ J_ 1 

241-20:^/21 24T+4x:/2I 

24 \ 21/ 24 V 21/ 
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The required chance, is the coefficient of in these two expan¬ 
sions. This is easily seen to be 

A [25 (M)"-(-i)"(A)'‘]. 

Note. As an alternative method of obtaining the coefficients of (f f)” 
and (- we may say that a (|^)^ 4- ^ ^ ^nd h can then 

be found from the conditions Uq — i and u^=i. 

The following examples are illustrative of the same method of 
attack. 

Example 27 . 

A player tosses a coin and is to score one point for every head turned 
up and two for every tail. He is to play on until his score reaches or 
passes n. If is his chance of attaining exactly n, show that 

Pn = \{Pn-l+Pn-2) 
and hence find the value of . 

There are two ways of reaching n exactly, namely, by throwing 

(i) a tail when the score is « — 2; or 

(ii) a head when the score is « — i. 

The respective chances are \p^ 2 iPn~i • i^ince these are mutually 
exclusive, we have 

Pn ~ ^ {.Pn—1 "f" Pn-^' 

The value of/)„ may be found in either of two ways: 


(«) 

/>«= 

■ i (pn~l +Pn- 2 )y 


or 

Pn + h 

^n~l -Pn~l-^iPn- 2 - 

.(0 

Also 

Pi- 

i) 


and 

P 2 = 

^ obviously. 

4 

.(2) 

By repetition of (i) 

and use 

of the facts in (2) we find that 


Pn = I ~ i + i “ 

-i+ - 


L» 

which simplifies easily 

to 




I f, 

st 



(6) We may treat 

l-^X- 

■ as the scale of relation of a 

recurring 

series and proceed as in Rxample 26. 
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If the series is 

Pi = iypi = i andpo=i* 

The generating function of the series is 

I _ I 

i (i ~.v) (i-f ^a;)* 

By partial fractions this becomes 

2 I 

Expanding by the binomial theorem, we obtain for the value of />„ 
(the coefficient of x^) the same result as is found by the first method. 

Example 28. 

A has 10 counters and B has 5; their chances of winning a single 
game are in the ratio 2:1. The loser in each game is to give a counter 
to his opponent. The game stops when one or the other has lost all his 
counters. Find A's chance of winning all B's counters. 

Let Un be A's chance of winning all B^s counters when A has n 
counters. In the next game A must either win or lose a counter. His 
chances of these contingencies arc § and ^ respectively. When he has 
lost the next game his chance of winning all B's counters is 
when he has won the next game it is ,,1. 

Hence «n = f«n+i + ^«„-i- 

It is required to find from the above relation, given that Wjg = i 
(since A will then have won) and Wq = o (since he will have lost). 

The required relation for the recurring series is therefore 

l-x-hW. 

Since we may write this relation as 

we may put = « (1)^4-^, and then obtain a and 6 from the values of 
i/q and . 

This is a particular example of the problem of “duration’of play’^ 
The problem in its general form gives m counters to A and n counters 
to B, and states that A's chances of winning, drawing and losing a single 
game are/), q, r respectively, wherep + ^ + r = i. The method of solution 
is precisely similar to that above. 
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The inductive method can be adapted to other types of question in 
probability. An excellent example will be found in J.LA. vol. LVi, 
pp. 102-104, where a problem in direct probability is solved very simply 
by the inductive process. 

22. We will conclude this chapter with some miscellaneous ex¬ 
amples. These problems require only a careful application of the 
ordinary methods, and no special comment is necessary. 

Example 29. 

The sum of two positive quantities is constant and equal to zn. Find 
the chance that the product of the two quantities is not less than | their 
greatest product. 

The product of two positive quantities whose sum is constant is 
greatest when the quantities are equal. The greatest product is therefore 

2n zn^ 2 
2 ‘ 2 ~ 

If the two quantities are x and zn — x we must have 

X {zn — x)^—y 
4 

i .e. — ^nx -f 3n2 o, 

i.e. (zx — 3«) {zx — w) <o. 

Therefore x must lie between and -. 

2 2 

The possible values of x range from o to zn, and the chance required 
is therefore ^ 

zn 

Example 30. 

Ten clubs compete annually for a cup which is to become the absolute 
property of the club which wins it for three years in succession. Assuming 
that all the clubs are of equal skill, find the chance that last year’s 
winners, not having won the previous year, will ultimately win the cup. 

X 

Let — be ^’s chance of winning outright, A having won the previous 
year. Each of the others has a chance equal to 

X 

I- 

10 

. .. ■ 9 

9 

since the clubs are of equal skill. 


f3«. 

]2 


2 ) 


F M A S 


16 
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Now (a) A *3 chance of winning next year and the year after 

(^) A's chance of winning, losing and then having the same 
chance as the other eight losing clubs 

^ I 9 ^~io 

lo'io' g 

(c) A*s chance of losing and then having the same chance as 
the other eight losing clubs 

„ 9 10 

10' g 

X 

A*s total chance = («) 4 -(^) 4 -(^), and this we know to be Solving 

the resulting equation, we find that . 4 ’s chance is ^7. 

Note, The above solution assumes that the total probability of win¬ 
ning, for all the clubs, is unity, i.e. that the cup must eventually be won. 
It is easily seen that the cup must be won outright. If the chance that 
any particular club fails to win the cup outright after m trials is i/w, 
where n is greater than i, then if there be an infinitely large number of 
sets of trials, the chance that the cup is never won outright by any 

/l\km 

particular club will not be greater than Lt which is obviously 

zero. In other words, if the contests be continued for a sufficient length 
of time, the chance that the cup is not won outright is zero, i.e. the cup 
must be won. 


Example 31. 


The atoms of a certain radio-active clement are continuously dis¬ 
integrating at the uniform rate of per annum, those atoms that dis¬ 
integrate thereupon becoming atoms of a different element. What is the 
average period of years that an atom of the original element will survive 
as such, and what is the probability that any particular atom will survive 
that period.^ 

Let the year be divided into n instants. 


Then the chance that an atom disintegrates in any one instant is 


>> >> yy 


I 

300« 

1 

300/1 * 


and 


does not disintegrate 
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Now let / (r) be the chance that the atom has not disintegrated at the 
end of the rth instant. 

Then = 

and the average number of instants required for disintegration is 



= 300/7, i.e. 300 years. 


The chance that an atom survives this period is 
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1. Explain how the probability of a compound event, consisting of 
two constituent simple events, is obtained. Illustrate your answer by 
examples. 

2. The chance of one event happening is the square of the chance 
of a second event, but the odds against the first are the cube of the odds 
against the second. Find the chance of each. 

3. If three squares are chosen at random on a chess-board, show that • 
the chance that they should be in a diagonal line is 7^4. 

4. A man has three current English coins. Find the chance that he. 
can give change for half a crown. 

5. A can hit a target four times in 5 shots; B three times in 4 shots; 
C twice in 3 shots. They fire a volley; what is the probability that two 
shots at least hit? 

6. A and B stand in a ring with ten other persons. If the arrangement 
of the twelve persons is at random, find the chance that there are exactly ‘ 
three persons between A and B. 

7. The first twelve letters of the alphabet are written down at random. 
What is the probability that there are four letters between the A and 
the B? 
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8. Find the chance of drawing two white balls in the first two draws 
from a bag containing live red and seven white balls, balls drawn not 
being replaced. 

9. An experiment succeeds twice as often as it fails. Find the chance 
that in the next six trials there will be at least four successes. 

10. If an experiment is equally likely to succeed or fail, find the 
chance that it will succeed exactly n times in 2w trials. 

11. Find the chance of throwing ten with four dice. 

12. If a die whose faces are numbered from i to 6 is thrown four 
times, what is the probability that the sum of the four throws is 14? 

13. A five-figure number consisting of the digits o, i, 2, 3, 4 (no 
repetitions) is chosen at random. What is the chance that it is divisible 
by 4? 

14. Out of a bag containing thirteen balls, six are drawn and replaced, 
and then seven are drawn. Find the chance that at least three balls were 
common to the two drawings. 

15. If a die is thrown five times what is the probability that a six 
appears on at least two consecutive occasions? 

16. What is the chance that a person with two dice will throw aces 
exactly four times in six trials ? 

17. There are m candidates taking an examination paper of n ques¬ 
tions of equal difficulty; assuming that a candidate answers a question 
correctly or not at all, either being equally likely: 

{a) In how many different ways may a paper be answered ? 

{b) How many different sets of answered papers are possible? 

{c) What is the chance that a set of papers is handed in in which 
a particular question is solved by not more than one candi¬ 
date? 

18. Six cards are chosen at random from a pack of 52. Find the 
chance that three will be black and three red. 

19. A card is chosen at random from each of six packs of cards. Find 
the chance that three will be black and three red. 

20. The 26 letters of the alphabet are placed in a bag. A and B 
alternately draw a letter from the bag, the letters drawn not being 
replaced. The winner is the one who draws most vowels. A starts and 
draws a vowel with his first draw. What is his chance of winning? 

21. A book contains looo pages. A page is chosen at random. What 
is the chance that the sum of the digits of the number on the page is nine? 
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22. A bag contains three tickets marked with the numbers 00, 01, 
10, and two tickets each marked with the number ii. A ticket is drawn 
at random eight times, being replaced each time. Find the probability 
that the sum of the numbers on the tickets thus drawn is 33. 

23. If a; be one of the first hundred numbers chosen at random, find 
the probability that x -h is greater than 50. 

24. In a lottery there are 1000 tickets numbered i to 1000. Three 
tickets are drawn. Find the chance that 

(1) the three tickets bear consecutive numbers; 

(2) two of the three bear consecutive numbers, 

25. If m odd integers and n even integers be written down at random, 
prove that the chance that no two odd numbers are adjacent to one 
another is 

n ! (w+ i) 1 

(m -h n) I (n — /n -h 1) r 
m being not greater than n+ i. 

26. The sum of two whole numbers is 100; find the chance that their 
product is greater than 1000. 

27. There are ten tickets, five of which are numbered i, 2, 3, 4, 5 
and the other five are blank. What is the probability of drawing a total 
of 10 in three trials, one ticket being drawn and replaced at each trial? 

28. If two of the first hundred numbers are chosen at random, what 
is the probability that their difference is greater than 10? 

29. A and B have equal chances of winning a single game; A wants 
two games and B wants three games to win a match. Find the chance 
that A will win the match. 

30. A and B play at a game which cannot be drawn. On the average 
A wins three games out of five. Show that it is more than 2 to r that A 
would win at least three games out of the first five. 

31. Ay By C throw in order, each using three dice. Prove that A's 
chance of throwing 10 first is and find C's chance. 

32. A and B play for a prize. A is to throw a die first and is to win 
if he throws 6. If he fails, B is to throw and win if he throws 6 or 5. 
If he fails, A is to throw again and win if he throws 6 or 5 or 4, and so 
on. Find each player’s chance of winning. 

33. A and B play for a stake which is to be won by him who makes 
the highest score in four throws of a die. After two throws A has scored 
12 and B 9, What is A*s chance of winning? 




PROBABILITY 


246 

34. A and B play a set of games, to be won by the player who first 
wins four games, with the condition that if they each win three they are 
to play the best of three to decide the set. A^s chance of winning a 
single game is to B^s as 2 to i. Find their respective chances of winning 
the set. 

35. Aj B and C draw in succession from a bag containing four white 
and eight black balls until a white ball is drawn. What is the probability 
that the white ball is drawn by C? Is his chance improved if each ball is 
replaced after drawing? 

36. Three players of equal skill, A, B and C, play a series of games 
and the winner of each game scores one point. Each of the three keeps 
a separate score and the winner of the set is the one who first scores 4. 
A wins the first, B the second and A the third game. What is then C’s 
chance of winning the set? 

37. A and B play a match of five games, ^’s chances of winning, 
drawing and losing any game are in proportion to 3, 2 and i respectively. 
Two points are scored for a win and one for a draw. What is the chance 
that the match is drawn? 

38. A and B play a match, the winner being the one who first wins 
two games in succession, no games being drawn. Their respective chances 
of winning any particular game are p : q. Find 

(1) A’s initial chance of winning; 

(2) A*s chance of winning after having won the first game. 

39. Three players, A, B, C, play under the following conditions. In 
each turn the chance of success is the same for each of two contestants. 
A and B play together for the first turn, the winner plays with C, and 
if he win again he wins the game; if not, C plays with the third man 
and so on until one man has won two turns in succession. Find each 
man’s chance of winning the game. 

40. The winner of a game is the one who first scores 4 points, but 
if both players score 3 points the game continues until one player has 
scored 2 points more than the other. A and B play; find A^s chance 
of winning when the score is 2—o in B’s favour, being given that A 
is twice as skilful a player as B, 

41. Af B, C, D each throw two dice for a prize. The highest throw 
wins, but if equal highest throws occur (thrown by two or more players) 
the players with these throws continue, ^ throws 10; find his chance 
of winning. 

42. A and B cut a pack of cards, the player who wins the cut six 
times to be the winner. A^ having won four times to .B’s once, cuts a 
five. Find the chance that A will be a winner. 
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43. A and B play a match consisting of a maximum of nine games. 
The chance that any game is won by A, won by B or drawn are equal. 
A win counts one point and a draw half a point (to each). The match 
ends when one of the players has a sufficient lead to leave him with an 
excess of points over his opponent even if the latter were to win all the 
remaining games. What is the chance that the match ends with the 
7th game and not before.^ 

44. A bag contains ten counters, numbered i to 10, One counter is 
drawn and replaced and this operation is repeated until four different 
numbers have appeared. Calculate the probability that success will be 
attained with the sixth draw. 

45. If n is the product of any 69 integers taken at random, find, to 
the first significant place of decimals, the value of the probability that 
n is not a multiple of 5, given that logj^ 2 = -3010300. 

46. A bag contains counters marked with the digits 2, 4, 6, 8 and 
the number of times each digit occurs is equal to the value of the digit. 
Counters are drawn one at a time, each counter being replaced when 
drawn. What is the chance 

(1) that the digit 2 is drawn before the digit 8; 

(2) that the sum of the first three digits is 16; 

(3) that the first five counters drawn contain at least one of those 

marked 4 or 6.^ 

47. A and B have each eight pennies. Each tosses his set of pennies. 
Find to three places of decimals the chance that the number of heads 
obtained by A exceeds the number obtained by B by at least three. 

48. At a certain age 99 per cent, of the persons alive at the beginning 
of the year will live to the end of the year. Find expressions for the 
probabilities that out of four persons of that age there will die within 
the year 

(1) exactly 2; 

(2) not more than 2; 

(3) two specified persons; 

(4) two specified persons and no others. 

49. If the probability that exactly three lives out of six all aged x 
survive n years is -08192, find the probability that at least three survive 
n years. 

50. The probability that exactly one life out of three lives aged 20, 
35 and 50, will survive 15 years is -092; the probability that all will die 
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within 15 years is *006. If the probability that the life aged 20 will 
survive 1 5 years is -9, find the probability that 

(1) he will survive 30 years; 

(2) he will survive 45 years. 

51. Three men, Ay By C, are each aged 30. Given that the probability 
that a man aged 30 will survive 5 years is *974 and the probability that 
he will survive 10 years is *940, find the chance that between the end of 
the 5th year and the end of the loth year from now 

(1) one at least will die; 

(2) all will die, A dying first and B second. 

Find also the chance that A and B will die within this period and C will 
survive the loth year. 

52. The following table shows the probability that a woman of the 
age specified will marry in a year: 


Age 

Probability of marriage 

20 

•0665 

25 

•1033 

30 

•0649 

40 

•0183 


Find the probability that, out of 4 women aged 20, 25, 30, 40 re¬ 
spectively, only one marries within a year. 

53. Given that the probability that of three lives aged x one, and one 
only, will survive n years is 27 times the probability that all will die 
within n years, find the probability that 

(a) at least two will survive n years; 

(b) at least one will die within n years. 

54. Given the following table, find the probability that of four persons 
aged 65 at least one will die between ages 75 and 85 and at least one 
after age 85: 

Age Probability of surviving 10 years 

65 One-half 

75 One-fifth 

55. Urn things are distributed amongst a men and b women, show 
that the chance that the number of things received by the group of 
men is odd is equal to 

I {b + a)^-{b-a)'^ 

2 * Ib + a)^ ‘ 

56. The sum of two positive quantities is constant and equal to 2«. 
What is the chance that their product is less than 
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57. How many times must a man be allowed to toss a penny so that 
the odds may be 100 to i that he gets at least one head? 

58. A coin is tossed m-\-n times {m>n). Prove that the chance of 

fi~\- 2, 

at least in consecutive heads appearing is Find also the chance 

of a run of exactly m consecutive heads. 

59. If ten different things be distributed among three persons, show 
that the chance of a particular person having more than five of them 

,*e 

19 6 8 3* 

60. If /> he the chance that an odd number of aces turn up when n 

ordinai7 thrown, show that i — 2p — (\Y. 

61. A pack of cards has been dealt in the usual way to four players. 
One player has just one ace; prove that the chance that his partner has 
the other three aces is yo¥* 

62. A bag contains a certain number of balls some of which are white. 
I am to get a shilling for every ball so long as I continue to draw white 
only, the balls drawn not being replaced. An additional ball not being 
white is introduced and I claim as compensation to be allowed to replace 
every white ball that I draw. Is this fair? 

63. There are three sets of cards, red, yellow and blue. Each set 
contains ten cards, numbered i to 10. Three cards are drawn at random. 
Find the chance that the sum of the numbers on them equals 15: 

(1) if the cards are all to be drawn from the red set; 

(2) if one card is to be drawn from each set; 

(3) if the cards are to be drawn from the three sets mixed indis¬ 

criminately. 

64. From a bag containing ten red, ten white and ten blue balls one 
is to be drawn at random and replaced. The operation is to be repeated 
ten times. Find the chance that at least one ball of each colour will be 
drawn. 

65. Out of yi consecutive integers three are selected at random. 
Find the chance that their sum is divisible by 3. 

66. In a book of values of a certain function there is one error on 
the average in every m values. Find the number of times, r, a value 
must be turned up at random in order that you may have an even chance 
of turning up an erroneous value. Show that when m becomes large 
the ratio r\m tends to a fixed quantity and find this quantity. 
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67. A, By C and five other football teams enter for a competition. 
The teams are of equal skill and are drawn by lot in pairs before each 
round, the winners of the previous round entering the next round. 
Find the chance that in the course of the competition A will beat By 
having first beaten C. 

68. A street consists of 24 houses numbered i to 24, odd numbers 
on one side, even on the other. Three houses are vacant. Assuming that 
the houses are all equally likely to be vacant, find the chances; 

(1) that the three houses are next to each other; 

(2) that all three are on the same side of the street; 

(3) that if they are all on the same side the sum of their numbers 

equals 42. 

69. A put 10 balls in all, some of which were white, some red, 
and some black, into a bag, and asked B and C, neither of whom 
knew the number of balls of each colour, each to make 1000 draws 
and note the colour of each ball drawn. J^’s and C’s results were 
exactly the same, viz.: 

White 600 

Red 300 

Black 100 

It subsequently turned out that whereas B had returned each ball 
to the bag after noting its colour, C had kept the balls drawn in hand 
until he had drawn all ten balls and had then returned them to the 
bag and made another 10 draws, and so on. 

What difference docs this make to the probabilities of drawing balls 
of different colour as inferred from the results of the experiment, and 
what in the circumstances was the probability that B and C would 
obtain the same result? 

70. Find the chance that the sum of the numbers on three cards 
drawn at random from an ordinary pack of 52 cards amounts to 21, all 
the court cards counting as 10. How will the result be altered if an ace 
can count as i or 11 ? 

71. If a coin be tossed 15 times, what is the probability of getting 
heads exactly as many times in the first 10 throws as in the last 5 ? 

72. A bag contains eight counters numbered i to 8. Four are drawn 
at random. Find the chances that 

(1) the sum of the numbers on the four counters amounts to at 

least 17; 

(2) the counters numbered 2 and 3 are among the four; 

(3) the four counters contain at least two of the three numbers 

3» 5, 7- 
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73. If 6 n tickets numbered o, i, 2, ... 6w--1 are placed in a bag and 
3 are drawn out, find the chance that the sum of the numbers on them 
is equal to 6w. 

74. From a bag containing nine red and nine blue balls nine are drawn 
at random, the balls being replaced. Show that the probability that four 
balls of each colour will be included is a little less than 

75. A looks at a clock at some time between 2 and 5 p.m., all times 
within the limits being equally likely. He looks again when it strikes 
the next quarter hour. What is the chance that in the meantime the 
minute has overtaken the hour hand? 

76. Four suits of cards, each suit consisting of 13 cards numbered 
from I to 13, are dealt to four persons. Find the chance that each 
person’s cards contain all the numbers from i to 13. 

77. There are four sets of calculations on one sheet which have to 
be made, then checked and finally scrutinized. A and B can calculate 
only, C and D can calculate or check, and E and F can scrutinize only. 
No person may check a calculation he has made and all work must be 
signed. Find the chance that when the sheet is finally completed each 
name of the above six appears exactly twice. 

78. From an ordinary pack of cards a card is drawn and then six 
other cards at random. Find the chance that the card first drawn is the 
highest of its suit amongst all the cards drawn. 

79. A bag contains thirteen balls of which four are white and nine 
black. If a ball be drawn r times successively and replaced after each 
drawing, show that the chance that no two successive drawings shall 
have given white balls is 

i6.i2^-(~3)^ 

80. The reserved seats in a certain section of a concert-hall are 
numbered consecutively from i to 100. A man sends for five consecutive 
tickets for one concert and for eight consecutive tickets for another. Find 
the chance that there will be no number common to both sets of tickets. 

81. In a cup draw there were four Southern and four Northern 
teams, the names being drawn one by one from a bag. Find the fol¬ 
lowing probabilities: 

(1) that each one of the four successive pairs consisted of one 

Southern and one Northern team; 

(2) that each one of the four successive pairs consisted of two 

Southern or two Northern teams; 

(3) that the first drawn in at least three of the pairs was a Southern 

team. 
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82. A number consists of seven digits whose sum is 59. Find the 
chance that it is divisible by ii. 

83. Two dice are thrown and one of the players will win (a) if the 
sum be 7 or ii, or (b) if the sum be 4, 5, 6, 8, 9 or 10 and the same 
sum reappears before 7. Find the player’s chance of success. 

84. A and B cast alternately with two dice. It is agreed that, on each 
failure to win, the prize money is to be reduced by 3 per cent, of its 
value at the previous attempt. A wins if he throws 6 before B throws 7, 
and B wins if he throws 7 before A throws 6. A starts first. Compare 
the values of the respective chances of A and B, 

85. A bag contains n counters marked i, 2, 3 ... n. If two counters 
are drawn show that the chance that the difference of the counters 
exceeds m (less than n — i) is 

{n~m) {n — m —1) 
n{n-~i) 

Deduce from this result the chance that if two points are taken at 
random on a line the length between them exceeds half the length of 
the line. 

86. A bag contains two white balls and one black ball. A drawing 
of two balls is made. If either is black, the two are replaced and another 
black ball is added. A second drawing of two balls is then made, and 
again if either is black, the two are replaced and another black ball is 
added and so on. 

What is the chance that if the drawings are continued indefinitely 
two white balls will never be drawn together.^ 

87. ^’s chance of scoring any point is of B'%. A engages to score 
14 in excess of B before B shall have scored 3 in excess of A. Show that 
A*s chance of winning the match is equal to 

I-(If 

I-(!)"' 

88. A and his wife engage in a “mixed doubles” tennis tournament 
in which each pair of players consists of one member of each sex. There 
are fourteen other persons, seven of each sex, also entered for the 
tournament and players are drawn by lot before each round in such a 
way that any person of one sex may be the partner of any person of the 
other sex. Only the winners in any one round enter the next round. 
Assuming that all the players of each sex are equal in skill, find the 
probability that in the final round A and his wife play together as 
partners. 
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89. All that is known about a quadratic equation is that the coefficients 
are all different and are positive integers less than 10. Find the chance 
that the roots of the equation are real, all the integral values of the 
coefficients satisfying the above conditions being equally likely, and zero 
values of the coefficients being excluded. 

90. A pack of cards is dealt in the usual way to four players of whom 
two and two are partners and the dealer turns up his last card. Denoting 
by the term “honours” the ace, king, queen, knave of the suit to which 
the turned-up card belongs, find the chance that each pair of partners 
shall have two honours. 

91. A number taken at random is squared. Find the chances that the 
following are even numbers: 

(1) the digit in the units place of the result; 

(2) the digit in the tens place of the result; 

(3) the digit in the hundreds place of the result. 

92. Before commencing a game of cards four players cut for partners, 
i.e. the two highest play together and the two lowest together. All suits 
being of equal value, what is the chance that they will have to cut again ? 

93. A and B play a series of games to be won by the player who first 
wins two consecutive games. chances of winning, losing or drawing 
any particular game are J and J respectively. Find B's chance of 
winning the match (a) at the outset, (b) when he has just won one game, 
and (r) when he has just lost one game. 

» 94, A and B have equal chances of winning a single game. A wants 
n games and J 5 games to win a match. Show that the odds on 

A are i -f/> to i -p where /> = —— 

^ ^ nlnl 2^^^ 

95. Three posts are filled one after another by lot from amongst ten 
persons (A, jB, C, Z), and six others), the first by any one of the ten, the 
second by anyone except A and By the third by anyone except C and D. 
What is the chance that A or B or both of them, and C or D or both of 
them, are chosen? No one can hold more than one post, and in drawing 
for the second and third posts the barred persons and any person 
previously chosen are excluded. 

96. With a hand of thirteen cards a player is known to hold one ace. 
What is the chance that he has at least one other ace ? 

If it is known that the ace he holds is the ace of hearts what is the 
chance that he has at least one other ace? 
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ELEMENTARY STATISTICS 

1. It is not the object of this book to deal exhaustively with the 
science of statistics. We shall treat only with the elementary ideas 
of the subject, and this treatment should be sufficient to enable the 
student to solve the simpler types of problem that are met with in 
practical work. 

Statistical investigation covers a very wide field and is concerned 
with the collection, tabulation and summarization of data, the in¬ 
terpretation of the results and the deductions that can be made 
therefrom. Collection and tabulation must here be taken for 
granted and the other aspects will be considered to a limited degree 
only. 

2. Elementary definitions. 

Suppose that a number of men were selected at random and 
arranged in order of their heights, and suppose that these men 
formed a representative and unbiased sample of the particular class 
under review. We should be able by mere inspection to obtain some 
idea of the variable characteristic—in this case, height—of the class. 
For some purposes this might be sufficient, but, in general, this 
method, namely, the assembling of the men, is quite impracticable. 
Further, we should probably need records of the experiment, and 
it is essential therefore to arrange the work so that these records 
may be readily made. 

If we were to represent the men by straight lines proportional 
to the heights of the men, drawn at equal intervals at right angles to 
a fixed straight line, and retained the order of magnitude as before, 
we should be in possession of a permanent record of the heights. 
An arrangement such as this is termed an array and the individual 
values of the measured characteristic are called variates. As an 
illustration of this pictorial review of a particular characteristic we 
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may consider the following figure, which shows the heights of 
42 men, all of whom are under 5 ft. 9 in. For convenience of 
illustration only the excess of height over 5 ft. is given, and 
fractions of an inch are ignored. 



Fig. 6. 

An examination of this array shows that there were 2 men whose 
height was 5 ft. 3 in., 2 whose height was 5 ft. 4 in., and so on. 
These numbers, which from the diagram are 2, 2, 2, 5, 12 and 19, 
are known as the frequencies with which the heights occurred. 

The full sample of which the above is an extract consisted of 100 
men for whom the table of frequencies is as under: 


Height of man to the 

Number of 

nearest inch j 

cases 

S ft. 3 in. 

n 

5 4 

2 

5 5 

2 

5 6 

5 

5 7 

12 

5 « 

19 

5 9 

16 

5 10 

16 

5 II 

14 

6 0 

7 

6 I 

2 

6 2 

3 


100 
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These data can be arranged to advantage in an array, not of 
variates, but of frequencies, in either of the following forms: 



Fig. 7. 



Fig, 8. The block diagram in this figure is called a histogram. 

It should be noted that whereas Fig. 6 represents an array of 
heights with one scale only, namely height. Figs. 7 and 8 necessitate 
the adoption of two scales, height and frequency. In these figures 
we have, in fact, shown values of a variable (frequency) compared 
with those of the independent variable (height). 

3 . Reference to the table on p. 255 will show that the figures in the 
first column are given to the nearest inch. It is obvious, however, 
that, for example, the 19 men whose heights were'stated to be 
5 ft. 8 in. were not all exactly of this height. “Heights of 5 ft. 8 in. 
to the nearest inch” means any height between the limits 5 ft. 7I in., 
to 5 ft. 8i in., and this interval of one inch is called the class- 
interval The recorded height of 5 ft, 8 in. may be taken as the 
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mid-value of the class range. Data may be displayed either in 
the form given in the table on p. 255, or with the class range 
indicated by its lower limit, as shown in the following example 
relating to the age distribution of a body of lives: 


Age 

Frequency 

20*5- 

38 

21*5- 

70 

22*5- 

82 

23-5- 

lOI 

24*5- 

128 

25-5- 

175 

26-5- 

181 

27-5- 

200 

28 - 5 - 

181 

29-5- 

I go 


1346 


Here the class-interval is one year, and it is often assumed for 
practical purposes that there were 38 cases of age 21, 70 of age 22, 
and so on. In many distributions it will be found convenient to 
use a larger class-interval than a unit interval; examples of various 
class-intervals will occur later in this chapter. 

A set of relative frequencies tabulated according to class ranges 
is called a frequency distribution. Given an ungrouped set of ob¬ 
servations it is a matter for careful consideration as to the best form 
of grouping to adopt in order to obtain a reasonable frequency 
distribution from which to work. 

The following is an important point which may arise in the con¬ 
struction of a frequency distribution. Suppose that in examining 
the data for a distribution such as that in the above table it was 
found that there were two persons exactly 21^ years old at the date 
when the observations were made. In such a case it is the practice 
to assign one of these to the group zol-zil and the other to the 
group zi^zz^y since they may be said to belong equally to each 
of the groups that they divide. If there had been three of such 
cases, 11 would be assigned to the lower age-group and i^ to the 
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Upper group. It may happen therefore that although we are dealing 
with integral numbers of cases the distribution may show fractional 
frequencies when the table is completed. 

4 . Averages. 

It will be convenient at this stage to revert to a pictorial repre- 
wsentation of an array of variates as exemplified in paragraph 2. In 
the following example a small sample only has been taken in order 
that the deductions that may be drawn from the figure may be 
clearly shown. 

The data relate to the ages of a class of 25 students and are given 
in the following table: 


Nearest age 

Number of 
students 

18 

I 

19 

2 

20 

1 

21 

6 

22 

3 

23 

i 4 

24 

I 

25 

I 

26 

I 

27 

I 

28 

4 


_ 


The array of variates is 



Fig. 9. 
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There are 25 variates depicted in this diagram in order of magni¬ 
tude, and since there happens to be an odd number, we can at once 
single out the middle variate. The middle variate corresponds to an 
age 22 and is a particular variate of our sample. The statistical name 
for the middle variate is the median^ so that the median, or to be 
more precise the median age, for this distribution is 22 years. If 
therefore a number of such experiments were made with different 
samples of similar students we should be in a position to compare 
one average characteristic of each sample, namely the median age. 

If we examine the diagram and table again, we see that the 
variates which occur with the greatest frequency are those repre¬ 
senting the number of students at age 21. We have found the mode^ 
or most frequent value, and this again is another average. In this 
case it happens that the mode is immediately apparent from the 
table giving the frequency distribution, although the median can¬ 
not be deduced without further investigation. 

Now an average may be defined as a quantity which typifies the 
magnitude of the set of variates from which it has been ascertained, 
and it may occur to the reader that the simplest average that could 
be found from the given data would be the ordinary arithmetic 
mean or, if we were using a frequency table, the weighted meani 
This is so, and by simple calculation it is evident that the mean is 
22*8 years. 

We thus have three averages: 

Mean ... ... 22-8 years 

Median . 22 years 

Mode ... 21 years 

and it remains for us to determine which of these averages is the 
most satisfactory for statistical purposes and, in addition, what use 
we can make of them. 

It should be noted that frequency distributions may be illustrated 
by curves drawn through the peaks of the ordinates of a frequency 
diagram such as that in Fig. 7. While the positions of the mean, 
mode and median are often represented on frequency curves by 
ordinates, it is really not the ordinates but the corresponding 
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abscissae which give the values of the mean, etc. Thus, in the fre¬ 
quency curve in Fig. lo below, the values of the three averages 
are XA, XB and XC and not the ordinates through B and C. 



5 . It is not immediately apparent which particular kind of average 
is most suitable for any group of observations, and it requires a con¬ 
siderable amount of experience before a student can use these 
averages to their best advantage. There are, however, criteria which 
enable us to select the most satisfactory form of average for general 
adoption. The principal criteria may be stated as follows: 

A satisfactory average should be 

(а) rigidly defined; 

(б) based on unbiased observations as numerous as possible; 

(c) relatively stable with fluctuations of sampling; i.e., though 

different samples might produce slightly different re¬ 
sults, the variation from sample to sample should be as 
small as possible; 

(d) capable of easy calculation and simple algebraic treatment. 

Let us consider the example in paragraph 4. If it had happened 

that our data included no cases over age 27—i.e. the four cases at age 
28 were non-existent—the mean age of the 21 cases observed would 
be about one year younger than before—about 21*8 years—while the 
median and the mode would remain unaltered. It appears from 
this simple example that a few extreme cases more or less may 
considerably affect the mean, while the median and mode are un¬ 
changed. But this possible disadvantage of the mean disappears 
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when larger distributions are under consideration. In point of 
fact, the mean possesses several advantages over the median and 
the mode. It is rigidly defined, while the median is somewhat 
indefinite. Again, the mean can easily be calculated and lends itself 
to simple algebraical treatment, while, as will be seen later, it is 
generally impossible to obtain more than estimates of the other two 
averages. For these reasons the mean is the average that is used to 
a very great extent in statistical calculations. 

6. Calculation of the mean. 

Let n be the number of observations, x^. the value of the variate 
in the rth class-interval and /,. the frequency in the rth class- 
interval, so that 'Lfj.==n. 

Then M, the mean, is simply H n ^ ifr^r)y where 

the summation extends over all the class-intervals. This evidently 
gives a weighted mean value of x^,. 

The calculation of M can obviously be effected from this formula 
whatever the size of the distribution and wherever in the range the 
mean happens to be. If however the values of x are large (such 
as measurements in a small unit) and the frequencies are also 
numerous, a change of unit and origin will simplify the arithmetic 
considerably. 

Thus, if a is the value of the temporary origin and d,. is the 
deviation of Xj. from a, then 

Xf. = a-{-dr. 

Now 

= l^[fr{a + d,)] 

= ‘[an + S(/A)] 

= ^ + ^S(/A). 
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and if a be suitably chosen, i.e. so that the deviations of x from a 
are small and of different signs, it is evident that the calculation of 
S {frSj) is far less troublesome than that of S (/r^r)* 

Example i. 

Calculate the mean age of wife corresponding to the age-group 60- 
of husband from the 1921 Census of married couples. (See Chap. X, 
paragraph 10.) 

The first point to note is that the class-interval is five years, so that 
the age of wife must be assumed to be the mid-value of the range. The 
data have been tabulated and a glance at the figures suggests that a 
suitable origin will be the mid-value of the group 55-. The method of 
calculation in the table is self-explanatory. 


Age (mid-value of 
class range) 

Frequency 

/ 

Deviation from 
origin in terms of 
class-interval 8 
as unit 

ys 

27-5 

I 

-6 

- 6 

32-5 

2 

~5 

- 10 

37-5 

4 

-4 

— 16 

42-5 

10 


- 30 

47-5 

21 

~2 

- 42 

52-5 

53 

— I 

- S 3 

57-5 

126 

0 

-157 

62-5 

163 

4-1 

+ 163 

67-5 

35 

4-2 

+ 70 

72-5 

6 

+ 3 

+ 18 

77-5 

I 

4-4 

+ 4 


422 


+ 255 


Thence 2 (/S) = 255 -157 

= 98 class-intervals 
=490 years, 

^S(/8)=490/422 = 1-15 .... 

••• M=57-5 + i-i5 .... 

= 587 years (say). 


so that 
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An alternative method of obtaining the arithmetic mean, if the 
class-interval be taken as the unit, is based on a single and double 
summation of the column of frequencies. Thus, if the origin is 
taken at one value lower than the lowest value given, the initial 
value of the variable may be denoted by i, the second by 2 and 
so on. 

A first summation gives 2/,=/i+/2+/3+ ... +/„-2+/„_i+/„. 

Summing again from the bottom upwards, 

S(S/r)=l/i + 2/2 + 3 / 3 + ••• +(«- 2)/«-2 + («-l)/n-l + w /« 

r“=n 

= s r/,. 

r«l 


If therefore the original initial value is ky so that the origin is 
at A — I, the mean value is 






An example will make this clear. 


Example 2. 

Find the mean height of the sample of men from the table in para¬ 
graph 2. 

The lowest value of the height is 63 in. Take the origin at 62 in. 


Height in 
inches 

Deviation from 
origin 

/ 

S/ 

63 

I 

2 

100 

64 

2 

2 

98 

65 

3 

2 

96 

66 

4 

5 

94 

67 

5 

12 

89 

68 

6 

19 

77 

69 

7 

16 

58 

70 

8 

16 

42 

71 

9 

H 

26 

72 

10 

7 

12 

73 

11 

2 

5 

74 

12 

3 

. 

3 

.. . 



I 100 

700 
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Then the mean height = 62+ 700/100 = 69 in. 

In a large group it is often simpler to take the origin at k and the 
second summation up to the second line from the top. Thus, in this 
example, the mean height = 63+ 600/100 = 69 in. 

This method is convenient in actuarial calculations where it is 
necessary to find the average age of a distribution consisting of 
comparatively large numbers. 

7 . Calculation of the median* 

The median is the middle variate of the array, and if we were to 
construct a diagram showing the array of the variates of the ob¬ 
servations in, say, the example above, it would be possible to select 
the middle variate. In practice it is generally impracticable to 
construct such a diagram, and a reasonable approximation obtained 
from a table of frequencies is, as a rule, sufficient for most purposes. 

Consider, for example, the table in Example i. There are 422 
cases, so that the median will lie between the 21 ith and 212th cases. 
Now from the frequency distribution it is seen that up to and in¬ 
cluding the class range 50-55 there are 91 observations, while if 
the next group be included there are 217 observations. The median 
is therefore in this last group of 126 cases. It is reasonable to 
assume that the first observation in this group has the value of the 
lower boundary of the class range, the last observation has the 
value of the upper boundary and the 126 observations increase 
uniformly within the range. Since we require the age correspond¬ 
ing to a hypothetical case between the 211th and 212th cases, simple 
proportion will give the median to be 

--, 5(211-5-91) 

126 

the class-interval being 5 years. 

Then the median = 55 + 

= 55+478 ... 

= 59*8 years (say). 
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8 . Calculation of the mode. 

The mode is the value which occurs with the greatest frequency. 
Although in a case where the variates proceed by unit variation it 
may be possible to select at sight the most likely value, this is not 
practicable where the distribution is one which concerns a con¬ 
tinuous variable. Thus in Example i all that can be said is that the 
mode lies in a five-yearly group of which 62-5 is the mid-interval, 
although an examination of the table might suggest that the re¬ 
quired value is nearer the lower limit than the upper one. 

In many cases a good approximation to the mode can be found 
from the following simple formula: 

Let M, and represent the values of the mean, median and 
mode respectively. 

Then = 

In Example i the value of the mode according to this formula is 

^0 = 587-3 (587- 59 - 8 ) 

= 587 + 3-3 

= 62*0 years. 

It should be emphasized that this relation though often very close 
is not exact, and is not applicable to all distributions. Other 
methods have to be adopted in extreme cases, although even then 
a reasonable approximation may often be obtained by drawing a 
smooth curve through the statistics. 

9 . Ogive curve. 

When a rough estimate of the median is required it is often 
useful to use a graphical method. This method is best illustrated 
by an actual example. 

If we take the data from the table on p. 263 we may write them 
down in an alternative form, thus: 
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Height of not more 
than X inches 

X 

No. of cases 

63 

2 

64 

4 

65 

6 

66 

II 

67 

23 

68 

42 

69 

58 

70 

74 

71 

88 

72 

95 

73 

97 

74 

100 


Then, if the numbers so found be plotted against the heights and 
the points joined by a smooth curve, we have an ogive curve. 
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An approximate value of the median will be the abscissa corre¬ 
sponding to the ordinate midway between the 50th and 51st cases, 
i.e. about 68^ inches. 

The mode can also be estimated from the ogive, its position 
being where the slope of the curve is steepest. In many cases 
however the shape of the curve is such that all that can be said is 
that the mode lies between certain comparatively wide limits, and 
in those circumstances no better approximation can be obtained 
from the curve than is evident from a consideration of the statistics 
in their original tabular form. 

10. Probable value and expectation. 

Before we proceed further with the discussion of the mode or 
most likely value, it will be convenient to refer to another aspect of 
the mean. 

Suppose that there are n counters in a bag, each counter having 
a value in shillings. Then, if we are given the number of counters of 
each particular value it is a matter of simple arithmetic to find the 
average value of a counter. All that we need do is to weight the 
number of counters having a particular value by that value, add the 
products and divide by the number of counters. 

Example 3 . 

Find the weighted mean of the following distribution: 

No. of counters (jc) 9 i 2 5 7 

Value of counter (y) 12345 

in shillings 

Total value of counters 

= 9 + 2 + 6 + 20 + 35 

= 72, 

Total number of counters = 2 a? = 24, 

Weighted mean = 'Lxy/liX = 3 shillings. 

We may look upon this problem from another point of view. 
Suppose that we use the word prize to denote the value of a quan- 
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tity to be attained. In general, the attainment of a prize is con¬ 
tingent upon the happening of some event and may therefore not 
always be secured. If the amount of the prize is S and it is obtained 
in^A^ trials out of a total of N in which the opportunity of obtaining 
the prize can occur, then the probability of its being obtained is />. 
Also, the total value of all the prizes is SN and the average value is 
pS, This average value is called the expected value or probable value. 
If we are concerned with the person who is playing for the prize 
the probable value of the prize is called his expectation. In other 
words, expectation is the product of an expected gain in actual value 
and the mathematical probability of obtaining such a gain. It may 
be noted that the gain or prize is not necessarily restricted to 
financial gain, but may be a number of years of life or a number of 
heads in a series of throws. 

Let us consider the problem above in the light of this definition, 
and calculate the expectation of a man who is allowed to draw one 
counter at random from the bag. 

The chance that he draws a counter value one shilling = ; 

the chance that he draws a counter value two shillings = ^, 
and so on. 

His expectation, which is the same as the probable value of his 
draw, is, therefore, 

(2 j) ^ + (A) 2 + ('A) 3 + (m) 4 + (¥4) 5 

__ 9 + 2 + 6 + 20 + 35 
24 

= 3 shillings, as before. 

It is seen from this example that the difference between weighted 
mean and expectation lies solely in the order of the operations re¬ 
quired to obtain the result. It is sometimes more convenient to 
treat such a question as one in which the mean is to be calculated, 
while at other times the second order should be adopted. Thus, in 
the following questions, the expectation is most easily calculated 
by introducing the chances of success and failure. 
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Example 4 . 

A player throwing an ordinary die is to receive £(1/2”), where n is the 
number of throws that he takes to throw the first 6. Find his expectation. 
His expectation 



Example 5 . 

A table is divided into six squares numbered i to 6. A player places 
a coin on a certain square. Three dice are thrown. If the number thus 
backed appears once, twice or thrice, the player receives back his own 
coin together with one, two or three others respectively of the same 
value. In any other event he loses his stake. Does the advantage in the 
long run lie with the player or the “banker”? 

(a) The chance that no die shows the number backed = (|)^. 

(/;) ,, one die shows ,, =3(i)^i* 

(c) ,, two dice show ,, =3i(4)^- 

(d) „ three dice show 

The net expectation of the player 

= W + 2 (c) + 3 {d)-{a)= 
and that of the banker 

^-{b)-2{c)-3{d) + {a) = ,\\, 
so that the advantage lies with the banker in the long run. 

For a stake of a shilling it is easily seen that this advantage is just 
under a penny. 

Here it might be contended that as the player is to receive back his 
own coin in addition to the prize, his expectation should be based on 
respective receipts of two, three or four units instead of one, two or 
three units, as appears in the above solution. His expectation on this 
basis might be argued as being 2 (^) + 3 (<^) + 4 (^“(^) or which 
would show a substantial advantage to the player. Further consideration 
would show, however, that if the return of the stake is treated as a 
profit^ then the laying of the stake must be treated as a payment to the 
banker for the privilege of playing and that it is, accordingly, definitely 
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paid away whether the player wins or loses. His expectation then be¬ 
comes 2 (Z>) + 3 (c) + 4 (t/) — I = — as before. 

11 . In connection with mathematical expectation it is interesting 
to note the celebrated St Petersburg problem, which has been a 
fruitful source of discussion for nearly two hundred years. Briefly 
stated, the problem is this: A coin is tossed until head turns up. 
If head turns up first A is to pay B one unit; if head does not turn 
up till the second throw B is to receive two units; if not until the 
third throw four units, and so on. How much must B pay A 
before the game, in order that the game may be considered fair? 
That is, what is B's expectation? 

The theoretical solution is this: 

At the first trial B*s expectation is x i = 

„ second „ „ (1)^x2 = J. 


„ ;zth „ „ = i 

B*s expectation is therefore 1 + 1 +i+ ... to infinity, and as 
this is a divergent series, it appears that B could afford to pay an 
infinitely large sum before the game starts for his expectation. 

Many explanations of this result have been given by eminent mathe¬ 
maticians, notably by d’Alembert, Bernouilli and de Morgan. An inter¬ 
esting account of the problem with an alternative solution depending 
upon the amount of money that B possesses at the outset is given by 
Whitworth in his Choice and Chance, This solution depends on a 
somewhat arbitrary assumption, and a more practical limitation of B's 
expectation arises from the fact that A's resources, however great, must 
be limited. Poincare (Ca/rw/ des Prohahilites^ p. 42) shows that if A*s total 
assets are 2®, B*s expectation is i + For example, if A possesses 2^® 
—^which is more than a thousand millions— B^s expectation is reduced 
from infinity to 16, and as Poincare drily remarks, this is a considerable 
reduction I 

12 . It is essential to distinguish between the absolute probability 
of an event, the average number of times that it may happen over 
a series of trials and the most probable number of times that it will 
occur. The most probable value (or mode, in a statistical distribution) 
is the value that occurs with greatest frequency, and in simple 
examples this value can be easily determined by considering 
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separately the contingency of each event. I'hus in a single throw 
with two dice the numbers that may turn up and the chances of 
occurrence of these throws are given in the following table: 

Possible numbers: 23456789 10 ii 12 

Chances of occurrence: 12345654 3 2 i("^36) 

The most probable number to be thrown is therefore 7. 

Again, in Example 7 on p. 272, a hasty conclusion would be that 
half the pack (i.e, 26 cards) would have to be turned up before 
two aces out of the four would appear. This is not so, however. 
The four aces divide the remaining 48 cards into 5 groups. If a 
large number of such divisions were made the average number in 
each of the 5 groups would be The average number of cards 
which must be turned up before two aces appear is therefore 2 of 
these groups plus 2 aces, i.e. -^^ + 2 or 21*2, as on p. 272. The 
solution there given shows that 26 is neither the average number nor 
the most probable number, and, further, that there is a definite 
probability associated with each number of cards turned up. At 
the risk of labouring the point, it must be emphasized that these 
separate probabilities are only to be realized over a long series of 
trials. 

Example 6. 

A purse contains two half-crowns and three shillings; a second purse 
one half-crown and four shillings. A coin is drawn from the first and 
placed in the second, and then a coin is drawn from the second and 
placed in the first. Assuming that the chance of drawing a half-crown 
is twice that of drawing a shilling, find the most probable value of the 
coins in the first purse after the second operation. 

At the end of the second operation the first purse may contain: 

(i) 2 half-crowns and 3 shillings; or 
(i^ 3 >> >>2 >> 

(iii) I half-crown ,, 4 ,, 

The respective chances are found thus: 

(i) In order to achieve this result, a coin of the same value must 
be taken from the second purse at the second draw as was placed in this 
purse as the result of the first draw. 

Since the chance of drawing a half-crown is twice that of drawing 
a shilling, the chance that a half-crown is drawn from the first purse 
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originally is The second purse will then contain 2 half-crowns and 
4 shillings, and the chance of drawing a half-crown from this purse is f. 
The total chance that a half-crown is drawn on both occasions is 
therefore f .f. 

Similarly, the chance that a shilling is drawn both times is y.y. 

The total chance that the coins in the first purse have the same value 
at the end of the two operations is therefore 

1 5 _ 29 

7.8 -r 7 • "7 — '4 9 • 

By proceeding similarly, it can easily be shown that the respective 
chances under headings (ii) and (iii) are and . 

The greatest of these values is . 

The most probable value of the purse after the second draw is the 
value associated with this fraction, namely 8 shillings. 

Note that the probable value of the coins in the purse at the end of 
the second operation (i.e. the expectation of a person drawing the coins) is 

[(||x8) + (4®5X9j) + (f x6i)] shillings, or 7-|| shillings. 

In other words, it would be worth while to give about 75. gd. for the 
purse after the second draw. 

Example 7 . 

Cards are dealt one by one from an ordinary pack (without replace¬ 
ments) until two aces have appeared. Find (i) how many cards (on the 
average) will be turned up, (ii) the most probable number of cards to be 
turned up. 

(i) Since two aces cannot appear until the second trial at the earliest, 
we have 

Chance of success at the second trial = ^^2"-A 
third — 2 A. 3 . — 

ff jy uiiiu ,, •-^•52'51'60 

fourth *—■ o ^^ ^-7 4:. 3 . 

yy yy lOUrin ,, --3 • 6 2 ’ 6 T ' ^0 * 4 9 


and generally 

Chance of success at the xth trial = ^All, 

52.51.50.49 

The average number of cards to be turned up will be the sum of the 
series whose octh term is 

for all values of x from 2 to 50 (since there must be two aces at least in 
the first 50 cards). 
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Summing this by ordinary algebraic or finite difference methods, it 
is found that the average number of cards to be turned up is 21*2, 

(ii) The most probable number to be turned up will be the value 
of X which gives 

(a;-i)( 52 -.r)( 5 i-jc) 4.3 

52.51.50.49 

its greatest value. 

The value for x = r> the value for x~r~i so long as 

(»•- i) (52-^) (51 ( 53 -r) {52-r), 

i.e. so long as + 52^ — 51 > —r^ + S 5 ^~ 106, 

i.e. so long as 55 >3^ 

i.e. r is not greater than 18 (since r must be integral). 

The most probable number of cards to be turned up is therefore 18. 

EXAMPLES II 


I. Find the mean age of the persons in the following table: 


Exact age 

20 

21 22 23 

24 

25 

26 

27 

28 

29 

30 

Number of 
persons 

43 

68 120 150 

150 

130 

80 

90 

95 

81 

70 


2. Calculate the mean and median in respect of the following 
distribution: 


Length in 
inches 

No. of cases 

Length in 
inches 

No. of cases 

57-58 

2 

65-66 

641 

58-59 

4 

66-67 

532 

59-60 

50 

67-68 

316 

60-61 

100 

68—69 

124 

61-62 

169 

69-70 

72 

62-63 

543 

70-71 

20 

63-64 

762 

71-72 

5 

64-65 

750 

72-73 

I 


3. The infant mortality rates for a certain year in the aggregates of 
the urban districts of the 61 administrative counties of England and 
Wales were as follows: 

30, loi, 100, 53, 57, 72, 87, 48, 63, 70, 49, 58, 79, 94, 50, 98, 69, 56, 
100, 55, 71, 84, 71, 95, 114, 85, 61, 108, 64, 76, 99, 63, 68, 97, 80, 68, 

18 


F M A S 
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90, 7 S. 65. Ill, 90, 66, 6 s„85, 84, 82, 96, 99, 67, 71, 71, 91, 102, 80, 
89, 1 13, 74 , 87, 79, 88, 73. 

Discuss methods of setting out more clearly the information conveyed 
by these figures. Plot a frequency diagram and determine graphically 
the mode. 

4. The following table shows the marks obtained by loo candidates 
in an examination. Calculate the mean number of marks, and ascertain 
as accurately as you can the position of the median: 


Marks obtained 

Number of 
candidates 

1-5 

I 

6-10 

2 

11-15 

6 

16-20 

10 

21-25 

II 

26-30 

15 

31-35 

16 

36-40 

i 15 

41-45 

' 9 

46-50 

7 

51-55 

4 

56-60 

; 3 

61-65 

■ I 

100 


5. Estimate the mode from the data in Question 4. 

How would your estimate be affected if you were given in addition the 
following information? 


Marks obtained 

Number of 
candidates 

25 

3 

26 

2 

27 

I 

28 

6 

29 

3 

30 

3 

31 

4 

32 

4 
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6. Find, by drawing a frequency curve, the approximate values of 
the median and mode from the given data: 


Value 

•5 i-S 2-5 3-5 4-5 5-5 6-5 7-5 8-5 9-5 10-5 11-5 

PVequency 

II 17 26 38 56 83 120 163 196 181 93 16 


How would you check your results? 

7. The following are the death rates per thousand per annum of two 
towns in a certain year: 



Town A 

Town B 

Ages 



Death 



Death 

Population 

Death.s 

rate per 

Population 

Deaths 

rate per 




thousand 



thousand 

0-2 

3,000 

192 

64*0 

5,000 

300 

6o-o 

2-10 

10,000 

70 

7-0 

12,000 

78 

6-5 

10-20 

10,000 

40 

4*0 

10,000 

38 

3-8 

20~6o 

32.500 

260 

8-0 

25,000 

190 

7-6 

60 + 

8,500 

510 

60*0 

8,000 

460 

57-5 

All 

0 

0 

0 

0 

1072 

i 6-75 

60,000 

1066 

1777 


In each group the death rate of Town A is greater than that of 
Town B, but the reverse is the case when all ages are grouped together. 
Explain clearly why this is. 

8. The exports of a certain country during a period of 22 years were 
as follows, in millions: 


Year 

Exports 

Year 

Exports 

Year 

Exports 

1904 

200 

1912 

205 

1919 

207 

1905 

198 

1913 

205 

1920 

208 

1906 

197 

1914 

202 

1921 

200 

1907 

201 

1915 

199 

1922 

195 

1908 

202 

1916 

195 

1923 

193 

1909 

200 

1917 

197 

1924 

200 

1910 

195 

1918 

203 

1925 

208 

1911 

199 

— 

— 

— 

— 
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A writes to the paper to point out that trade is diminishing and gives 
the following figures: 

Average exports 1909-11 

1915-17 £^97 

1921-23 £ig 6 

B replies that it is increasing, and gives the following figures: 

Average exports 1906-8 £200 

1912-14 £204 

1918-20 £206 

Write a short letter commenting upon the letters and pointing out the 
true significance of the figures. 

9. What is the difference, if any, between weighted mean and expecta-- 
iton? Illustrate your answer by an example. 

10. Define carefully probable value and most probable value. 

A purse contains four half-crowns, three pennies and two shillings. 
Four coins are drawn at random. How many different sums can these 
amount to, and what is the most probable sum? (Assume that any one 
coin is as likely to be drawn as any other.) 

11. A coin is tossed until both head and tail have appeared twice. 

(1) On the average how many times will the coin have to be tossed? 

(2) What is the most likely number of throws? 

(3) How many throws must a man be allowed if the odds in favour 

of success are to be 7 : i ? 

12. Two persons throw an ordinary die alternately, and the first who 
throws 6 is to receive eleven shillings; find their expectations. 

13. There are eleven tickets in a bag numbered 1,2, 3, ... ii. A man 
draws two tickets together at random and is to receive a number of 
shillings equal to the product of the numbers he draws; find the value 
of his expectation. 

14. A bag contains eighteen exactly similar counters. Ten counters 
are each of value a, four are each of value b and the remaining four are 
each of value 2b. A man draws two counters at random. It is equally 
likely that any counter will be drawn. Find the ratio of a to by if the 
value of the man’s expectation is to be 4^/3. 

15. Each of two bags contains m sovereigns and n shillings. If a man 
draws a coin out of each bag, is he more or less likely to draw two 
sovereigns than if all the coins were in one bag and he drew two 
coins? 
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16. Purse A contains six shillings and two sovereigns, purse B seven 
shillings and one sovereign. Seven coins are transferred from A to B 
and then seven coins are transferred from B to A, Which purse is now 
likely to be the more valuable? 

17. A bag contains thirteen counters marked with the squares of the 
first thirteen natural numbers respectively. 

(1) A man draws a counter and is to receive the number of shillings 

equivalent to the number on the counter. Find his expectation. 

(2) If the man is allowed to draw three counters and to reject the 

highest and lowest, find his expectation. 

18. A purse contains five half-crowns and four shillings. A pays 
55. 2d, for the right to receive the value of three coins drawn at random. 
Criticize his bargain, and find the chance that, after two attempts, the 
second on the same terms as the first, he will be a winner. 

19. A bag contains m white balls and two red balls. The balls are 
drawn from the bag one at a time without being replaced until a red ball 
is drawn. If i, 2, 3, ... white balls are drawn, A is to receive i^, 2^ 3^, ... 
shillings respectively. Find his expectation. 

20. A bag contains twenty shillings and three sovereigns. Coins are 
drawn in succession, one at a time, without being replaced, until two 
sovereigns have been drawn. What is the probable number of shillings 
left in the bag? 

21. There are ten counters in a bag marked with consecutive numbers. 
Two counters are drawn from the bag. If the sum of the numbers 
drawn is odd, a man is to receive that number of shillings; if it is even, 
he is to pay that number of shillings. Find the man’s expectation 

(1) if the counters are marked from o to 9; 

(2) if they are marked from i to 10; 

(3) if they are marked from 2 to ii. 

22. A bag contains a coin of value M, and a number of other coins 
whose aggregate value is m. A person draws one at a time till he draws 
the coin of value M. Assuming it is equally likely that any particular 
coin is drawn, find the value of his expectation. 

23. A bag contains twenty white balls numbered i to 20 and ten 
unnumbered red balls. A ball is drawn at random and replaced, six 
times. Find the probability that 

(1) at least three white balls are drawn; 

(2) three white and three red balls are drawn; 

(3) three red balls and Nos. i, 2, 3 of the white balls are drawn. 

What is the most probable number of white balls drawn, and what is 

the probability that this number is drawn? 
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24. A man throws a six-faced die until he gets an ace. He is to receive 
£i if he succeeds at the first throw, lo^. if he succeeds at the second 
throw, 6j. Sd. if he succeeds at the third throw and so on. 

Given log 2 = -3010300, 

log 3 = '4771213, 
log 2718282 = -4342945. 

find the value of his expectation to the nearest penny. 

25. A and B have each two ordinary cubical dice, the faces being 
numbered from i to 6 on each die. They throw simultaneously until the 
whole of a stake of £100 has been divided on the following conditions: 

(i) If ^ throws 7 before i? has thrown 2, A is to take the balance 

of the stake then remaining. 

(ii) If B throws 2 before A has thrown 7, B is to take the balance 

of the stake then remaining. » 

(iii) If simultaneously A throws 7 and B throws 2 the balance then 

remaining is to be divided equally between A and B, 

(iv) If at the nth throw neither 7 nor 2 has yet been thrown by A 

or B respectively, B is to take ijn of the balance then re¬ 
maining, except that this additional benefit to B does not 
operate at the first throw. 

Find to the nearest penny the respective expectations of A and B, 
Given log 6 = -7781513, log 41 = 1-6127839, log e = -4342945. 



CHAPTER XII 


FURTHER PROPOSITIONS IN 
STATISTICS 

1. In the preceding chapter we have defined three different 
averages of a distribution, and any one of these three averages may 
be used to typify the whole distribution. Thus, the mean gives an 
indication of the magnitude of the variates, the median tells us the 
position of the middle variate and the mode gives us the most 
frequent value. But these averages tell us very little about the dis¬ 
tribution itself. For example, if the sums assured under whole-life 
policies were in question and we were given a random sample of 
loo such policies, the mean of the sums assured might be ^(^276, the 
median ^(^265 and the mode £250. From the mean, all that we 
could gather would be that the average sum assured was out 
of a total of £27,600; from the median that there were as many 
policies with sums assured less than £265 as there were policies 
with sums assured greater than this amount; and from the mode 
that there were more policies for £250 than for any other individual 
amount. What we require to know in addition is the extent to which 
the individual sums assured vary from the particular average. The 
statistical term for this variation is the dispersion of the group. 

2 . The range. 

The range of the distribution—i.e. the difference between the 
extreme variates—is a simple measure of dispersion. It would help 
us to some extent if we knew that in the example above the lowest 
sum assured was £100 and the highest £500; we should then know 
that the range was 3^400. This measure is however not of very much 
value, for it would not add appreciably to our knowledge of the 
distribution if it happened that there was only one policy for £100 
and only one for £$00, Another sample which gave the same 
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figures for the three averages might well have a smaller range, say 
from ;£i5o to ;£450, notwithstanding that the most important 
frequencies, at the centre of the sample, were not very different 
from those at the centre of the first distribution. It is essential 
therefore that a more accurate and stable measure of dispersion be 
chosen. The range does not depend upon all the variates and the 
measure that is likely to be most useful is one which, as far as 
possible, does satisfy this condition. 


3 « Mean deviation. 

If any particular average were taken and the deviations of the 
variates from this average were set down, it is obvious that the 
signs of some of the deviations would be positive and others 
negative. It might happen therefore that the sums of the deviations 
would be a small positive or negative quantity, or, in certain cases, 
zero, notwithstanding that the individual deviations were con¬ 
siderable. Consequently, if regard were had to sign, the algebraic 
sum of the deviations of the variates from, say, the mean would be 
of very little help as a measure of dispersion. As a result, in finding 
the total deviations from an average we ignore the signs of the 
individual deviations and treat them all as positive. We can then 
find the mean deviation^ irrespective of sign. 

The mean is the average which lends itself most easily to arith¬ 
metical and algebraic treatment, and, as a rule, the mean is chosen 
as the origin from which to calculate the mean deviation. The mean 
deviation may, however, be found from any arbitrary origin. The 
following example gives a simple method for the calculation of the 
mean deviation from the median. 

Example l. 

Calculate the mean deviation from the median of the distribution in 
Example i of Chapter XL 

The median is 59*8 years. Take the same origin as before (57*5) and 
assume that all the variates are concentrated at the mid-point of the 
group in which they lie. (See Note on p. 281.) 
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— a = 59-8 — 57*5 = 2*3 years = *46 class-intervals 


Age 

Frequency 

/ 

Deviation from the 
median in terms of 
class-interval 

8 as unit 

/s 

27-5 

I 

6*46 

6-46 

32-5 

2 

5-46 

10*92 

37-5 

4 

4-46 

17-84 

42-5 

10 

3-46 

34-60 

47-5 

21 

2*46 

51-66 

52-5 

53 

1*46 

77-38 

57-5 

126 

.46 

57-96 

62-5 

163 

•54 

88*02 

67-5 

35 

I-S 4 

1 53*90 

72-5 

6 

2-54 

15-24 

ITS 

1 

3-54 

3-54 


422 


417*52 


Mean deviation from the median = = *9894 ... class-intervals 

422 

= 4*95 ... years. 

Note. Strictly speaking, the assumption that all the variates are con¬ 
centrated at the mid-points of their respective groups should not be 
made for the group in which the origin lies. In this example, however, 
since the median is almost at one end of a group the result given above 
is virtually correct. The extreme cases arise when the mean deviation 
is required from the middle of a large group. In that event special 
treatment is necessary for this group. 

4 . Root-mean-square and standard deviations. 

The difficulty that some deviations are positive and some 
negative would be obviated if the deviations were squared. Then, 
by summing the squares, dividing by the total number of observa¬ 
tions, and taking the square root of the result, a form of mean 
deviation would be obtained which would be independent of the 
signs of the individual deviations. This measure of dispersion is 
called the rooUmean-square deviation. We shall denote this by the 
abbreviation r.m.s. deviation. 

If s be the r.m.s. deviation and the deviation of the frequency 
/, from the origin, then 
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The r.m.s. deviation possesses all the qualities of a good measure 
of dispersion; it is based on all the data, it lends itself to algebraic 
treatment and, in general, it is less affected by fluctuations of 
sampling than other measures. 

It has be^n shown that to calculate the mean without the use of 
a temporary origin may involve a considerable amount of arith¬ 
metic. This is even more marked in the case of the r.m.s. deviation. 
A formula similar to that obtained for the calculation of the mean 
from a temporary origin can, however, be found for the easier 
calculation of the r.m.s. deviation. 

Let a be a temporary origin and d be the deviation of the mean 
M from this origin. 

Let be the deviation of observation from a so that 

^ " CL ““ Xjt " 4 ” d^ 

where is the deviation of Xj. from the mean. 


Then f^S/=f^x/^+2f^^+f,d^, 


Now is the sum of the deviations, positive and negative, 

from the mean, and is therefore zero. (See paragraph 6.) 


i.e. 


or 

where 


S^ = a^ + d^ 


In order therefore to find ^ we may calculate the sum of the squares 
of the deviations from an arbitrary origin, so long as we know the 
deviation of the mean from that origin. 

If now the mean is taken as the origin, then rf=o, and the r.m.s. 
deviation is a minimum. In these circumstances, the r.m.s. deviation 
is called the standard deviation. The standard deviation is the 
measure of dispersion that is most commonly used in statistical 
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investigations, and supersedes for all practical purposes the r.m.s. 
deviation, which depends on an arbitrary origin. 

The recognized abbreviation for the standard deviation is s.d. 
and its value is generally represented by a. 

Example 2. 

Calculate the standard deviation of the distribution in Example i. 
The method of calculation is as follows: 


Age 

Frequency 

/ 

Deviation from 
mean 

8 

ys 

/s* 

27-5 

1 

-*r, 

- 6 

3 ^> 

32-5 

2 

-5 

— 10 

50 

37-5 

4 

“4 

— 16 

64 

42-5 

10 

-3 

~ 30 

90 

47-5 

21 

-2 

- 42 

84 

52-5 

53 

— 1 

- 53 

53 

57-5 

126 

0 

-157 


62-5 

163 


+163 

163 

67-5 

35 

4-2 

4- 70 

140 

72-5 

6 

+ 3 

4“ 18 

54 

ITS 

I 

4-4 

+ 4 

16 


422 


+ 255 

750 


2/rV = 750 and E/,. = 422 . 

Therefore 5^ = 750/422 = 1*777, 

d = ^fAI'^fr = {255 - I57)/422 = -232. 

and C7 = I • 31 class-intervals 

= 6*56 years. 

5 . The standard deviation gives not only, as already explained, a 
general idea of the dispersion or spread of the observations, but 
also a definite numerical limit to the frequency of extreme cases, 
i.e. those in which the variate is much larger or much smaller than 
the mean. 
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We shall prove the following proposition: 

In a set of measurements, Xj^ whose mean is let the 

s.d. be cr, so that 

No^ = sum of squared deviations == S (x,. — 

Then the proportion of cases in which x — x (the deviation from 
the mean) is numerically greater than Ao- cannot exceed i/A^: i.e. 
there cannot be more than NjX^ cases in which x is either greater 
than S+Aa or less than x — Xa. 

Suppose that the actual proportion is p, so that there are pN 
such cases. In each of such cases the squared deviation (x — x)^ is 
at least (Aa)^, so that the total squared deviations, arising from these 
cases, is at least pNX^a^. But this partial total cannot exceed the 
full total, Na^y derived from all the observations. 

Therefore pNX^cr^ > 

or /)>i/A2. 

Thus, if <7 is small the observations will be heaped up round the 
mean, and their “precision” is great; while if a is large the ob¬ 
servations will be widely spread, and their precision is small. 

It may be added that in the common case of a frequency dis¬ 
tribution (such as that of Example 2) which starts with small values, 
rises to much larger ones, and then falls to small ones, the actual 
value of p is usually much under the limit i /A^. 

This proposition is a simple case of TchebychefEs theorem. 
[Cf. W. F. Sheppard, J,LA. vol. liii, p. 82.] 

6. It is a simple matter to prove that the weighted algebraic sum 
of the deviations of a variable quantity from the mean is zero. 

For, if a be the origin, /,, the frequency of an observation Xy ., 
then S/^,/S/„ so that =MS/,. 

Also, if Sf is the deviation of AT, from a, 

^frXr =S/, {a + <?,), since =AT, — a, 

=aS/,+S/, 5 ,, 
i.e. MS/,=flS/,+S/A 

and M-a=S/A/S/,. 

If therefore the mean be taken as the origin, M=a and S/,<y,=o. 
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7. We have shown that the r.m.s. deviation is a minimum when 
measured from the mean. It is instructive to show that the mean 
deviation is a minimum when measured from the median. 

Let the frequency distribution be /i,/2,/3,/4, - -/rj -/n with 
class-interval a, and let S/^ = n. We may illustrate the distribution 
graphically, thus; 



Let be the mean deviation from the temporary origin X 
distant c {< a) from the beginning of the class range m to m + i. 

Then di=^{f„c+f^_i{a+c)^-f^_i{2a + c)+ ... +/i(»?-ifl + c) 

+/m+i (« - d) +/„+2 (2fl - c) + ... +/„ {n-ma + c)}. 
If now the origin is changed to m the mean deviation is where 


‘^2 = ^{/mO+/„-l«+/m-22«+ ... +/j(w-ia) 

n 

... 4 *^ (w ~ W«)}. 

= ^ ••• +/l^ 

(/m+1 ^ "I* fm+2 ^ • "i" fn^)} 


I 

n 


n \ r ^ m 

n+l J « U 1 1 . 


= ~i2M-n), 
n 


m 

where 

1 

d^=d^+^{n-2M), 

which <^1 if M>^n, i.e. if the median lies to the left of f^. 
Therefore, if n is even, the mean deviation is constant for all 
positions of the origin between the Jwth and (iw + i)th observa¬ 
tions, and this value is the least value. If n is odd, d is least when 
the origin coincides with the ^ (n-hi)th observation. That is, the 
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mean deviation is least when measured from the median, or if the 
median is indeterminate, from an origin in the class range in 
which the median lies. 

8. Semi-inter-quartile range. 

We have seen that, in an array of variates, the observation which 
is such that as many observations lie on one side of it as on the 
other is a particular form of average, i.e., the median, which can be 
used to obtain a measure of dispersion. There are certain observa¬ 
tions other than the median which divide the array and which may 
also be used to obtain such a measure. The observation which is 
such that one-quarter of all the observations lie below it in magni¬ 
tude is called the lower quartilcy while that observation which is 
such that one-quarter of all the observations lie above it is called 
the upper quartile. In the example on p. 255 the lower quartile is 
the height of the (hypothetical) man between the tenth and eleventh 
man, and is therefore 5 ft. 6 in., while the upper quartile is the 
height of the (hypothetical) man between the thirtieth and thirty- 
first man and is therefore 5 ft. 8 in. 

The quartiles, like the median, can be estimated by first difference 
interpolation from frequency distributions, or from a consideration 
of the ogive curve, and the student will find it a useful exercise to 
obtain the quartiles from the examples given in this and the 
preceding chapter. 

From the definitions given above we see that if and are the 
lower and upper quartiles respectively, one-half the total frequency 
lies between and . This range, namely — is called the 
inter-quartile range, and it is an even chance therefore that an 
observation taken at random will lie within this range. 

A simple measure of dispersion, and one which can be calculated 
with little difficulty, is the quartile deviation, jj, such that 

It is worthy of note that where the distribution is symmetrical, 
or nearly so, it is generally found that Q is approximately equal 
to §a. 
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The quartile deviation is generally called the semi-inter-quartile 
range and is an approximate method of estimating the dispersion 
of a frequency distribution. 

9. Relative measures of dispersion. 

The measures of dispersion described in the previous paragraphs 
are all absolute and relate to the individual distributions only. In 
order to compare the dispersion between two different distributions 
we need a measure of relative dispersion. Thus, the mean of a 
sample of sums assured may be ^(^317 and the standard deviation 
3^10, while the mean of another sample may be §620 and the 
standard deviation $30. It does not follow, however, that the 
variability in the second case is greater than that in the first. We 
cannot compare the variability of the two distributions without 
eliminating the absolute values of the measures of dispersion. To 
obtain a relative value we express the particular measure of dis¬ 
persion as a ratio of the average on which the dispersion is based. 
Thus, to compare two or more standard deviations we calculate the 
standard deviation as a percentage of the mean, and obtain thereby 
the coejficient of variation. If, therefore, a is the standard deviation 
and M the mean, the coefficient is looajM for each distribution. 

The following is a simple example involving the use of the co¬ 
efficient of variation. 

Example 3 . 

In order to pass a certain examination candidates must obtain at least 
40 marks. All successful candidates with less than 50 marks are placed 
in the third class. The distributions of those who passed in the third 
class in two successive years being given in the table below, find which 
was the more consistent set: 


Marks 

40 

41 

42 

'43 

44 

45 

46 

47 

48 

49 

No. of successful 
candidates in the 
first year 

I 


5 

10 

4 

I 

4 

5 



No. of successful 
candidates in the 
second year 




7 


3 

2 

1 _! 

6 

6 

2 




288 ELEMENTARY STATISTICS 

The more consistent set may be regarded as the one for which the 
coefficient of variation is the smaller. (See, however, Note below.) 


No. of 

First year 

Second year 

marks 

/ 

8 

/8 


/ 

a 



40 

I 

-3 

- 3 

9 

— 

— 

— 

— 

41 

— 

-2 

0 

0 

— 

— 

— 

— 

42 

5 

-1 

~ 5 

s 

— 

— 

— 

— 

43 

10 

0 

- 8 

0 

7 

“4 

-28 

II 2 

44 

4 

+ i 

~h 4 

4 

— 

“3 

0 

0 

45 

1 

+ 2 

+ 2 

4 

3 

-2 

- 6 

12 

46 

4 

-|- 3 

+ 12 

36 

2 

— I 

- 2 

2 

47 

5 

4 - 4 

+ 20 

80 

6 

0 

-36 

0 

48 

— 

— 


— 

6 

+ i 

+ 6 

6 

49 

— 

— 

— 

— 

2 

+ 2 

+ 4 

8 


—- 

~ 

CO 

CO 

I — 

~ 

— 

+ 10 

— 


30 

— 

30 

138 

26 

— 

— 26 

140 


First year : M = 43 + 30/30 = 44, 

cr=v 138/30 — 1 =V 108/30 =V 3*6 = 1-897. 

Coefficient of variation == 189-7/44 = 4*36. 

Second year: M=47 —26/26 = 46, 

<7 = V 140/26 — I = V 114/26 =V' 4*3846 = 2*094. 

Coefficient of variation = 209-4/46 = 4*47. 

The second coefficient of variation is larger than the first, so that the 
more consistent set was that in the first year. 

Note, Care should be taken in using the coefficient of variation as 
a measure of consistency. It might not be safe to use it if, for example, 
given two sets of frequencies, 2/^. and , /^ = ^ constant, for all 

values of r, 

10 . Corresponding relative measures for other absolute measures 
may be formed by dividing by the corresponding averages. The 
mean deviation — gives a relative measure of 

S — while the coefficient of variation for the quartile 

deviation may be taken as \ (0s~0i)/(03+0i)- 
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^11. Standard error and probable error. 

Before the characteristics of different distributions can be com¬ 
pared by means of calculated statistical measures, certain considera¬ 
tions must be borne in mind. One distribution may be a random 
sample drawn from a large parent population, and it is to be 
expected that another sample of the same size would produce 
results conforming to those of the first sample and of the parent 
population, were they available. The results may, however, differ. 
In that case we must find the degree to which they are reliable, or 
alternatively, to what extent the observed differences arise simply 
from errors of sampling and errors of chance. Consider, for 
simplicity, the value of the mean calculated from a sample. This 
value is one of a distribution of means calculated from all possible 
samples of the same size, which lie on a frequency curve of which 
the s.d. is found to be a'j's/n, where a' is the s.d. of all the obser¬ 
vations in the parent population and n is the number of observations 
in the sample. In practice a' is not known, and as an approxima¬ 
tion we use C7, the s.d. derived from the sample. In this case, a/\/w 
is known as the standard error of the mean. 

For reasons connected with the normal curve of error (see p. 290) 
the standard error is often multiplied by *6745 to produce the 
probable error. 

We may explain the use of the probable error in the following 
manner. 

If two values of a variable x (say a and b) are given and it is 
required to find the chance that a value of the variable selected at 
random lies between these limits we may construct a distribution 
curve of unit area, and this curve will be such that the area between 
the ordinates a and b represents the chance required. Distribution 
curves may be constructed either from data given for a question in 
mathematical probability or from statistical data obtained from the 
results of a series of observations. We may take, for example, the 
binomial curve. A simple conception of this curve results from 
calculating by the mathematical definitions the number of heads 
that turn up when m coins are tossed. The relative frequencies are 


F M A S 
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given by the successive terms of the binomial series (| + and 
by plotting tlie relative frequencies against the number of heads we 
obtain a bell-shaped curve symmetrical about the mean. When m 
is very large it can be shown that this curve approximates to a 
curve known as the normal curve of error. 


M 



In Fig. 12, which is in the form of the normal curve of error, 
NM is the ordinate through the mean of the means. PQ and 
P'Q' are the ordinates through points at distances from N such 
that PN and P'N are equal to the probable error of the means. 
Then it may be shown that the area of the curve cut off by PQ, 
P'Q' and PP' is half the total area. Ordinates drawn through 
distances from N equal to twice the probable error would include 
most of the curve, while ordinates through three times this distance 
would include practically the whole curve. 

As a result of this property of the normal curve it is usual in 
practice to say that it is improbable that a statistical measure 
calculated from a sample will differ from the value which would 
be obtained from the parent population by more than three times 
the probable error. Thus, if the difference between two calculated 
means exceeds three times the probable error of the difference, it 
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is assumed that this difference is unlikely to have arisen from errors 
of sampling, but is due to intrinsic differences in the two samples. 
The difference is then said to be significant. 

12. We have already seen that the calculation of mathematical 
expectation can be reduced to the determination of the weighted 
mean. Also, if we have to find the value which occurs with the 
greatest frequency from a consideration of ideal objects—the most 
probable value—we have a parallel in the mode of a statistical dis¬ 
tribution. Further, deviations from the mean in a statistical ques¬ 
tion correspond to deviations from the expectation in a problem in 
probability. 

This parallelism may be extended. Equally likely events have 
their counterpart in equally likely intervals or ranges. Just as 
equally likely events are rare in practice, so ranges are seldom 
equally likely. For example, a simple question in mathematical 
probability is: “Two clerks are in an office: one goes to lunch 
between 12 and i and the other between i and 2. Each takes an 
hour for lunch. Find the chance that they are not out at the same 
time.^’ Unless it be assumed that all times between the given limits 
are equally likely, the problem cannot be solved as it stands. But 
it can hardly be doubted that, in practice, all such times would not 
be equally likely—for a variety of reasons. Similarly, in statistical 
problems there will be certain intervals or ranges which are much 
more likely to occur than others. Consider, for instance, the case 
of a lamp filament. The resistance of such a filament cannot be 
forecast with certainty, and if the resistance is to be, say, 300 ohms, 
it is more probable that the resistance of a filament made for this 
purpose will lie between 300 and 310 ohms than between 390 and 
400 ohms. Again, the average height of the men in a given com¬ 
munity is 5 ft. 7 in. A man chosen at random from the community 
is more likely to be between 5 ft. 6 in. and 5 ft. 7 in. than between 
6 ft. I in. and 6 ft. 2 in. Many such examples occur in statistical 
problems. 


19-2 
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EXAMPLES 12 

I. The following table shows the proportion of male births per 1000 
births in a certain community. Find the mean and standard deviation 
of the distribution: 


Number of male births 
per 1000 births 

Number of cases 

465- 

1 

2 

477 - 

6 

489- 

43 

501- 

399 

513- 

176 

525- 

9 

537- 

2 


2. Calculate the mean age, the median age and standard deviation in 
terms of year of age from the following data: 


Age-group 

Number of cases 

5 - 

20 

10- 

300 

15- 

860 

20- 

435 

25- 

230 

30- 

150 

35 - 

70 

40- 

60 

45- 

35 

50- 

20 

55 - 

10 

6o~ 

7 

65- 

3 


3. Find the values of the upper and lower quartiles. 


Value 

•5 

1-5 

2*5 

3*5 

4*5 

5-5 

6*5 

7*5 

8-5 

9-5 

lo-s 

11*5 

Frequency 

22 

34 I 

52 

76 

112J 

166 

240 

326 

392 

362 

186 

32 


4. Estimate roughly the mean wage from the following data: 
Median 37s., Quartiles 29*5^. and 40*5^. 6 per cent, of the observations 
are less than 20s. per week and 3 per cent, are 455. per week or more. 
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5. Illustrate by a diagram the following distribution: 


Length in in. 

0- 

i~ 

2- 

3 “ 

4- 

5- 

6 - 

7 “ 

8 - 

9- 

10- 

II- 

No. of cases 

I 

1-5 

2-5 

4 

5*5 

8-5 

12 

1 

16 

20 

18 

.i'iJ 



Estimate the mode, median and upper and lower quartiles. Calculate 
the mean, and check approximately the relative values of the mean, 
mode and median. 


6. If in a series of measurements we obtain values of magnitude , 

of magnitude and so on, and if x is the mean value of all the 
measurements, prove that the standard deviation is 

- ^ - 

where x — k + 

Taking k as 200 apply this formula to obtain the standard deviation 
of the exports given in the adjoining table: 



Year 

Exports in 
£ (millions) 

Year 

Exports in 
£ (millions) 

Year 

Exports in 
£ (millions) 

1904 

200 

1912 

205 

1919 

207 

190S 

198 

1913 

205 

1920 

208 

1906 

197 

1914 

202 

1921 

200 

1907 

201 

19^5 

199 

1922 

1 195 

1908 

202 

1916 

195 

1923 

1 193 

1909 

200 

1917 

197 

1924 

200 

1910 

195 

1918 

203 

1925 

208 

1911 

199 


1 

j 



*7. Given the following statistics, find the probable error of the mean: 


Height of man 

No. of 
cases 

Height of man 

No. of 
cases 

5 ft- 3 in. 

2 

5 ft. 9 in. 

16 

5 4 

2 

5 10 

16 

5 5 

2 

5 II 

14 

5 6 

5 

6 0 

7 

5 7 

12 

6 I 

2 

5 8 

19 

6 2 

3 


What use is made of the probable error? If you measured another 
body of men and found that the mean height was 5 ft. 7 in., with nearly 
the same probable error, what conclusion would you draw? 
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•8. Find the probable error of the mean: 


Age-group 

Number of cases 

15- 

5 

20 - 

83 

25- 

157 

30 “ 

162 

35 “ 

147 

40- 

125 

45 “ 

105 

50- 

80 

55 - 

55 

60- 

40 

65- 

22 

70- 

14 

75 - 

4 

80- 

I 


9. Find the average wage, the median and the standard deviation of 
the following distribution: 


Weekly wage 

325. 6d. 

37i. 6d. 

425. 6d, 

475. 6d. 

Number of cases 

4 

3 

6 

10 

Weekly wage 

525. 6d. 

57^. 6d. 

62s, 6d. 

675. 6 d. 

Number of cases 

12 

16 

13 

II 

Weekly wage 

725. 6 d, 

775. 6 d. 

825. 6 d. 

875. 6 d. 

Number of cases 

8 

8 

4 

3 

Weekly wage 

925. 6 d. 

975. 6 d, 

1025. 6 d. 


Number of cases 

I 

0 

1 



10. The scores of two golfers for 24 rounds each are 

A 74» 75» 78, 78, 72, 77> 79» 7^, 81, 76, 72, 72, 77, 74, 70, 78, 

79, 80, 81, 74, 80, 75, 71, 73. 

B 86, 84, 80, 88, 89, 85, 86, 82, 82, 79, 86, 80, 82, 76, 86, 89, 

87, 83, 80, 88, 86, 81, 84, 87, 

Which may be regarded as the more consistent player? 
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II. A sample of the numbers of persons engaged in a certain occupa¬ 
tion is as follows: 


Age last birthday 

Number 

14 and 15 

13 

16 and 17 

35 

18 and 19 

41 

20- 

108 

25- 

176 

35 - 

204 

45 - 

190 

55- 

65 

60- 

39 

65- 

21 

70 and over 

12 


Find the standard deviation. 

12. What is meant by the term “Coejfficient of Variation”? 

During the first 20 weeks of a session the marks of two students 
taking the course were 

X 58, 59, 60, 54, 65, 66, 52, 75, 69, 52, 65, 66, 56, 42, 67, 30, 
50, 60, 46, 48. 

Y 56, 87, 89, 78, 71, 73, 84, 65, 66, 46, 84, 56, 92, 65, 86, 78, 
44, 54, 78, 68. 

Which would you consider was the more consistent? 




CHAPTER XIII 


MEAN VALUE. THE APPLICATION OF 
THE CALCULUS TO THE SOLUTION 
OF QUESTIONS IN PROBABILITY 

1 . It has already been seen that the mean of a number of quantities 
is simply the arithmetic average, weighted if need be. Thus, if there 
are n quantities <^(^2), + ^ ^ + then the 

mean value is 

(f> {(i) ^j) + <^ (^ + h<^ +... + ^ 

n 

Suppose that y=(j>{x) is a function of and that x has the 
n successive values a, a-h 2 /?,... a + {n — 1) h. Then the mean 

value of (l){x) for these n values from x = a to x~a-\-{n—i)h is, 
as above, 

(f> {a)-{-(l>{a + h) + <f>(a + 2 h) +... 4 -(^ + ih) 

n 

Let b=^a + nh so that nh = b — a. 

^ r*-n-l 

Then the mean value = -, D (a + rA) 
nh 

h r-n~l 

b-a r-o 

If varies continuously between a and b so that the number of 
values, w, tends to infinity, the mean value becomes 

T *. h[4>{a) + ^{a-\-h) + {a + n — ih)] 

- ~— - 



It should be noted that, where the function is continuous, the mean 
value depends on the law governing the selected values. For example, 
the mean value of the ordinate of a semicircle determined by ordinates 
passing through equidistant points along the diameter is different from 
the mean value determined by taking equidistant points along the 
circumference. It thus appears that the mean value of a continuous 
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function {x) is not a definite quantity but a quantity varying according 
to the law assumed for the successive values of x. 

This point is illustrated in Example 3. 


2. This application of the integral calculus enables us to solve 
many problems involving mean values. The solution of these 
problems can generally be effected by the use of single integrals, 
although some of the more difficult questions necessitate the use 
of double integration. The three following examples illustrate the 
use of single integrals and, as will be seen, the solutions present 
little difficulty. 


Example i. 

Find (i) the mean value of the ordinate, (ii) the mean value of the 
square of the ordinate of the curve y = a sin nx for the range x — o 

7 T 

to x — ~. 
n 


(i) We have to find the sum of the ordinates over the given range 
divided by the number of ordinates. Since the number of these ordinates 
will tend to infinity as the distance between them tends to zero, we 
shall have tt tt 


M.V.= 


rn rn 

ydx asmnxdx 

Jo _ _ 

TT ir 

rn rn 

\ dx \ dx 

Jo Jo 




a 

— cos nx 
n 


CL CL 

— COS TT 4 -- cos o 
71 n 


TT 

n 


a 

n 


(-1)+ 


a 

n 


TT 


since cos7r= — i 


2a 


TT 


n 
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(ii) Similarly 
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^ ± _ 3 

a 


6 ‘ 

Note. Since we are required to find the mean value of an area, the 
result must be of the second degree in a. 

Example 8. 

A ship steering a straight course at a uniform speed picked up a 
stationary object 2 miles away with the beam of a searchlight, the beam 
making an angle of 45° ahead with the course of the ship. The beam 
was kept on the object until it made an angle of 60° astern with the 
course. Calculate the mean distance of the object during this time. 


o 



Let O be the object and AB the course of the ship. Draw OZ) 
perpendicular to AB. 

Then the lengths of the various lines in the figure are OA = 2, 
AD= \/2j OD= \/2, DB= V^IVZ and OjB = 2\/2/\/3- 

Let P be any position of the ship, distant x from Z), so that OP^ = 2-\-x^- 
Then S, the sum of all lengths such as OP, 

rV2/V3 _ 

= Vz + x^dx 

J.-V2 

= J [xV + 2 4 - 2 log (;v + V X^ 4 - 

— V <6 

which, on evaluation, 

= I + V 2 + log (\/6 + Vs)- 
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/fV2/V3 

M.v, = 5 yJ dx 


f + V2 + log('v/6+^3) 



It will be noted that the variable has been taken as the distance from 
the point Z). If either of the angles POD or OPD had been selected as 
the variable an erroneous result would have been obtained. This can 
be seen by a consideration of the argument below. 


o 



The use of the angle at the vertex O as the variable implies that this 
angle changes uniformly during the progress of the ship. If the ship 
were sailing along the arc WXYZV of the circle round O, equal 
variations of the angle at O would correspond to equal distances along 
the arc, i.e. to a uniform speed of the ship. 

The ship, however, is sailing along AB. 

If the ship were to sail along AB so that the angles at O altered at 
a uniform rate, the ship would have to travel from ^ to jE' in the same 
time as from E to JF’, F to G, and G to B. In that event the ship would 
not be travelling at a uniform speed along AB, 

If the mean value is looked upon as the result of drawing lines from 
O to the base AB at equal intervals and taking the total length of all 
the lines divided by the number of lines when this number is in¬ 
definitely increased, it is obvious that the result when the lines are 
drawn so as to divide the base AB into equal parts will not be the same 
as when they are drawn so as to divide the angle AOB into equal parts. 

Similarly when the angle is taken at the base. 
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3. The use of double integrals. 

Consider first a function, (x), involving a single variable .t. 
Then the mean value of (f> (x) between the limits x = a and x~b is 

x^b 

S^(x)/ Si. 

a:=a / x~a 


If there be a function ifs (x, y) of two variables x and we may 
write similarly 


x=^b 


x^b 


M.v.= S 'Lijj{Xyy) Y, Si, 


where x and y proceed by small but finite intervals. 

If these intervals, say ijii and ijm for x and y respectively, be 
very small, the numerator and denominator of this fraction will 
be very large. 

Multiply both numerator and denominator by (i/«. i/m), so that 
the fraction is 


x^b i/==j 3 

s s 


T I i^—b y—^ j j 

^ Si--. 
nm! x-^ay^x nm 


Replace the small quantities ifn and ijm by Aa: and Ajy, and 
find the limit when A^ and H^y each tend to zero. Then, since the 
limit of a quotient is the quotient of the limits of the numerator 
and denominator (provided that the limit of the denominator is 
not zero), we have 

S S 0 (a:, y) ^y/\x 


M.v. = Lt ■ 


x—b y^ 

S S I AjyAv 


x^a y=^OL 


when A a:, A3; each tend to zero, 
rb 

^{Xyy)dydx 
Ja J a _ _ 

J* j^dydx 


The following examples are illustrative of the method of applica¬ 
tion of double integrals. 



302 


PROBABILITY 


Example 4. 

A rod of length a is divided at random into three parts. Find the 
mean value of the sum of the squares on the three parts. 

Let OP be the rod of length a. Take any point X in the rod distant 
X from O, and another point Y distant 

y from X, The squares on the three p ^ \ ^ j _ p 

segments of the line will be x^y ^ a —■ 

{a~x—yYy respectively. 

Let X be fixed. Then y will vary 
from o to a-Xy so that the total values of the sum of the squares 
OX^y XY^-y yp2 will be 

Y.[x^+y^ + {a-x-yf]y 

where y has every value from o to — ;x. 

Now let X vary. The limits of variation of are evidently from o 
to a. Then 


M.V. : 


•a ca—x 

[x^ -\-y^{a —X- '\’)“] clydx 
Jo Jo __ 

ra ra~x~‘ 

dydx 

Jo Jo 

n a-x 

{2x^ + 2 y^ 4- ^2^ - 2ax — 2ay + 2xy^ dy dx 

0 


I' 


ra ra—x 

dydx 

Jo Jo 

2X^y 4- f y’^ 4- a^y — 2axy — ay'^ + xy‘^ 


dx 


•a ] 

M dx 

• 0 1 

L Jo 


fa 

\2x‘^ (^2 - jc) 4 - § (a - a:)-** 4 - {a — x)~ 2ax {a - x) 

Jo 

— a{a — xY-\-x{a — xY\ dx 


' 0 
. 0 


{a — x) dx 


which becomes, on evaluating the integral. 


3 =i»=- 


2 
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It is important to note that if the sums of the required values can be 
obtained by considering separate sets of the values, the total sum must 
be found and divided by the total of the values. For example, if 
we could best solve a mean value problem by summing </> {x) for all 
values of x varying continuously from o to a, and ijj (x) for all values 
from a to 6, then we must write 


and not 


M.v.- 


• a 

<p (x) dx + 

. 0 J 

\p (x) dx 

a __ 



dx 

'a 


• a r 5 

(ji (x) dx Ip (x) dx 
Jo , j a 


dx 


b 

dx 

a 


The fallacy in the second expression is easily seen when we consider that 


A B 

c^b 


is not necessarily the same as 


A + B 
C + D' 


Example 5 . 

Find the mean value of the distance from one corner of a square to 
any point in the square. 

Let OABC be the square. Take any point X in the side OC distant x 
from O, and draw XM parallel to the side 
CB to meet the diagonal OB in M, Let ^ 

Y be any point in XM distant y from X. ^ 

The length O T = -f-y“ and 

XM = OX=x. 

X 

For a fixed value of x the sum of all values 
of OF, i.e. of will be 

y^x - 

Yi V0 

y—O 

since Y may take up all possible positions 
on the straight line XM, But x may have all values from o to a. 

Therefore the sum of the distances from the corner O to any point 
in the triangle OBC 

a rx _ 

Vx^+y^dydx, 

0 Jo 
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For the sum of the distances from O to any point in the square OABC 
we must double this. The required mean value is therefore 


2 

n 

X 

Vx^-{-y^dydx 

0 

2 

i:j 

ra: 

dydx 

'0 


1 ra - 

“ -o [a/ 2 + log (i -f V2)] dx, 

Jo 

on integrating with respect to y and inserting the limits o and x, 
= ^ [V'2 4 -log(i + V2)]. 


4. Application of the calculus to probability. 

When we are dealing with problems in probability where the 
number of cases involved depends upon magnitudes varying 
continuously over a given range, the method of approach is similar 
to that outlined above for the solution of mean value problems. 
The general principle is to take the quotient of the number of 
favourable ways by the number of possible ways, where all ways 
are equally likely. 

The application of the integral calculus to problems in probability 
is best illustrated by examples; as a general rule it is sufficient to 
employ single integrals, although in some instances it is of advantage 
to use double integration. Many problems can be solved by either 
method, and examples of both methods are given below. 

Example 6. 

A line of given length is divided into three parts by two points taken 
at random. Find the chance that no one part is greater than the sum of 
the other two. 

We shall adopt the method of single integration for the solution of 
this question. 

Let one of the random positions 

be at P distant x from the end A of ^- »—a--1 —b 

the line, and let AC = CB = \a. <-— x---> ® 

(i) Consider the favourable cases 
in which AP( = x) is less than ia, 

Take a point Q in the line such that PQ=^\a, Then, for the conditions 
of the problem to be satisfied, the other random point R must lie in 
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CQ (otherwise PR or RB will be greater than half the line and conse¬ 
quently greater than the sum of the other two parts AP, RB or AP, PR). 

Now P lies in the small part dx between distances x + dx and x 
from Ay and since P has been taken at random, the chance that it falls 
dx 

in this small part is 

a 

The chance that R lies in CQ is 

^ 9 . - ^Q-^C _x + PQ-AC _ X 
a a a a' 

since PQ and AC are each ^a. 

Therefore the total chance that P falls in dx and in C0 is 

X dx 

0 a' a* 

the limits of x being o and la, 

(ii) Consider the favourable cases in ^ " 
which AP{ — x) is greater than la. < 

Then, as above, R must lie in QC, 
the chance of which is 

QC __ PQ — CP _ — x-h^a_^a — x 

a a a a ' 


X ■ 


P 

-> 


Fi^r. 18. 


Therefore, since the limits of x are now Ja and a, the chance that P 
falls in dx and R in CQ is 

a —X dx 


' ia 


a a 


The total chance required 




^^^x dx a —X dx 
0 a' a ^ ' a 


ix^ 


a 

I /I a:^ 

' (P \2 4 


a 

ka 


j r ]< 

0 « L Ji 


Example 7. 

Two points are selected at random on a line of length a. What is the 
probability that none of the three sections into which the line is thus 
divided is less than ^a? 


F M A S 


20 
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As an alternative this question will be solved by the use of double 
integrals. 

Let AB be the given line divided p q 

into four equal parts at C, D, E, Now A -M-•-1—J ^ 

if P and Q be the random points, then € D E 

to satisfy the required conditions Fig. 19. 

neither can be in y 4 C or EB. 

Let P be at distance x from A, Then the limits of x will evidently 
be \a and \a. Let Q be at distance y from A ; then Q can take up any 
position from along the line from P to the point E, As the origin is 
at A, the limits oiy will be \a-\-x and |a. 

Again, when P is fixed, all possible positions of Q will be from P 
to B, i.e. y can vary from x to a. Obviously x can vary from o to a. 

Then by the unitary definition, the required chance is 


a 3^ 
•2 ci ' 


0 4 r2 

dydx 

a , J a 

A 4+'^ A 


4 4 


O a ra 

dydx 

X Jo 

a 

[ [la-ja-jc] 

J a 


dx 

X 

dx 


dx 


ra 

{a —x) dx 
Jo 




fax — ^ax — ix^ 


ax — 


1 

-T6> 


on inserting the limits and simplifying. 

Note. Each of the above two problems can be solved by the different 
methods here demonstrated. The two questions are exactly similar, and 
with the necessary alterations in the limits precisely the same working 
can be applied. 


Example 8. 

Two independent events, A and must each happen once and once 
only in the future. The chances of their happening in the interval from 



ILLUSTRATIVE EXAMPLES 


307 

t to t + dt are proportionate to aPdt and b^dt respectively, where t is 
the time elapsed and a and b are constants (positive fractions). Find 
the chance that the two events happen in the order AB. 

Let the chance that A happens at the moment of time dt be ka^dt 
and the chance that B happens be l¥dt. Then, since the events must 
happen, we have 

00 

ka^dt—\ and 
0 

This gives ^ — log a and 

Now the chance that B happens between now and time t from now 

i 

IbFdx. 

0 

Therefore the chance that B has not happened by that time 

= I — [ Ib^dx, 

Jo 



l¥dt = i. 
0 

/ = — log b. 


The chance that A happens at the moment of time dt 

^kaJdt, 

and the chance that A happens at that moment, B not having happened, 
= {i-^^^U^dx^ka*dt. 

Therefore total chance that the events happen in the order AB 


which becomes 


log a 

log a + log b 


on substituting for k and / and evaluating the integrals. 


Example 0. 

In a certain year A and B were in London for one period only in 
each case, A for one-third of a year, B for one-quarter of a year. As¬ 
suming that in the case of A any one period of one-third of a year 
and in the case of B any one period of one-quarter of a year is as likely 
as any other period, find the probabilities that 

{i) A was in London the whole of the time that B was; 

(2) A and B were not in London at any moment together; 

(3) ^ and B were in London at some moment together; 

(4) A came to London before B, 


20-2 



I 

f 9* 


308 PROBABILITY 

(1) The chance that A arrived in London at point of time between 

t and from the beginning of the year is y, since he must have 

arrived in the first two-thirds of the year. 

The chance that B arrived in the one month permissible = 

Therefore the chance that A was in London the whole of the time 
1 dt 1 

that B was = - o = . 

Jo 9 ^ 9 

(2) The chance that A and B were not in London together = {a) the 
chance that B arrived after A had left + (Z>) the chance that B left 
before A arrived. 

For {a) B must have arrived between times t + \ and i.e. in the 
space of time ^ — The limits of / are o and -/jt. 

5 

f''‘“ IlJlf ^ = 

Jo i I 144 

For {b) B must have arrived between the times o and / —J, and the 
chance of this event, namely that B left before A arrived 

2 

f 3 t-ldt ^ 

li i I 144' 

4 

Total chance under this head = yf. 

(3) This is evidently the complement of (2) and is 

r_2^ or 
1 7 2 7 2* 

(4) If A came to London before B the chance is evidently 


Therefore 


chance; 


-1: 


l-t dt 
0 1 §~ 9 ’ 


5. Geometrical solutions. 

Many of the above types of question can be solved by the aid 
of geometry. Since a definite integral represents an area, the ratio 
of the number of favourable ways to the total number of ways 
when there is continuous variation between the limits can evidently 
also be solved by relating the areas enclosed by parts of curves. 
These curves or parts of curves may take the form of rectilinear 
figures, and in this event the problem can often be solved in a 
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simpler manner by the use of geometry than by having recourse 
to the methods of the calculus. 

Consider for instance Example 7 (p. 305): 

If the straight line be divided at random into three parts 
Xy yy a — x—jy the following must hold to satisfy the conditions 
of the problem: 

x-\-y<a\ Xyyy a — x—y t2ic\\>\a. 

We have 

y>la) 

and a~x—y>\ay 

giving x-{-y<la, 

Y 

N 


B 

O A P M X 

Fig. 20. 

Draw the straight lines LM, x+y — a and NPy x-i-y = ia and 
complete the diagram as shown: 

OM=a = OL; OA = ia; OB = la; AP=\a, 

Then the conditions above may be illustrated thus: 
x>\a means that the points must be to the right of AD ; 
y>\a means that the points must be above BC ; 

X -f jy < fa means that the points must be below the line NP. 

The only points satisfying all these conditions are contained in 
the shaded area. 




PROBABILITY 


310 

Similarly, for the total possible positions governed by the con¬ 
dition x-\-y<a, it will be seen that all possible points lie in the 
triangle LOM, 

The required chance is therefore 


Shaded area _ ^ ^ 

LOM 


as before. 


6. We will conclude this chapter by solving a further problem by 
integral calculus and by plane geometry. The alternative solutions 
by integral calculus are given in order to show that there may be 
more than one method of approaching the question, and the geo¬ 
metrical solution is an excellent example of the application of 
elementary methods to a seemingly difficult problem. 

Example 10. 

X starts between 2.30 and 3 o’clock to walk at a uniform rate of 
4 miles per hour from A through B and C to D. Y starts from D in 
the reverse direction between 2 and 3 o’clock and walks at a uniform 
rate of 3 miles per hour. From ^ to 5 is 2 miles, B to C 1 mile, C to 
D 3 miles. What is the chance that they meet between B and C assuming 
that between the given limits any time of starting is equally likely 

Method (i). 

The chance that Y leaves D between t and t + dt past 2 is He 

^ 60 

arrives at C at ^ minutes past 3, and at B at t + 20 minutes past 3. 
The chance that he meets X between B and C is that X has not reached 
C by / minutes past 3, i.e. that X has not started before ^ +15 minutes 
past 2 and that he has reached B by t + 20 minutes past 3. 

If X has reached j5 by t + 20 minutes past 3, he must have started 
by < + 50 minutes past 2. 

In order to meet, X must have started between (f-f 15) and (jf + 50) 
minutes past 2; the earliest time cannot be before 2.30 and the latest 
after 3. 

Therefore if t is 10 or less, X can start at any time between 30 and 
t + 5o; 

, 1 + 20 

chance =-. 

30 

If t is 10 to X can start anywhere from 30 to 60; 

chance = M. 
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If ^ is 15 to 45, ^ can start anywhere from f+15 to 60; 


Required probability 


chance = 

30 


_ t-{■ zo dt ^ 30 j* 45 -1 

“Jo 30 60 j 10 30 60 ji5 30 60 

= (on evaluating the integrals). 


Method (ii). 

The chance that they miss is the sum of the chances that Y starts 
too early and Y starts too late. 


(a) Y starts too’early. 

The chances that X reaches B between t and t-\-dt minutes past 3 is 
As Y cannot reach B until 3.20, t must be greater than 20. 

f 30 /_ 20 dt I 

Therefore chance that Y is too 


dt 

30’ 


f30 t 

i early = 

J 20 


60 30 36’ 


(b) Y starts too late. 

The chance that X reaches C between t and t-{-dt minutes past 3 

is -- , and the limits of these times are evidently 3.15 and 3.45. 

30 

The chance that Y has not arrived at C by ^ minutes past 3 = - ^ ^ 


Therefore chance that Y is too late = 


bo —t dt 
1 15 60 30 

Therefore the chance that they miss = + 2 == il* 

Therefore the chance that they meet = i - 

— 17 

Method (iii). 


60 


Let the straight line ABCD (Fig. 21) represent the route. 

Draw two straight lines at right angles to AD at A and D respectively 
and choose a suitable unit on both these straight lines to represent an 
hour. Then if X starts from A at 2.30 he would cover the distance AD 
by 4 o’clock; if he starts at 3.0 he would reach D by 4.30. Similarly, 
if Y starts at 2 o’clock he would reach ^ by 4 o’clock; if at 3 o’clock 
by s o’clock. 

If, therefore, we join these points by the straight lines A^D-^\ A^D^ 
and DA^ \ ^4^4, the rhombus KLMN represents graphically the space 
of time over which it is possible for X and Y to meet. 
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Draw perpendicular to AD meeting DA^ and A^D^ in 65 
respectively, and C6\ perpendicular to AD meeting A^D^ and A^D^ 



Fig. ai. 


in q, Cg respectively. Then the space of time over which X and Y can 
meet between the points B and C on their journey is represented by 
the area 


The required chance = 


area h^h^Mc-^c^ 
area KLMN^^ 


By simple geometrical methods the value of this is found to be 


EXAMPLES 13 

1. There are m posts in a straight line at equal distances of a yard 
apart. A man starts from any one and walks to any other; prove that 
the average distance which he will travel after doing this at random 
a great many times is J (m + i) yards. 

2. If two milestones be selected on a straight road n miles long, what 
is their average distance apart? 

3. Two quantities are taken at random from o to a; find, by means 
of the integral calculus, the chance that the greater of the two is less 
than a given value b. 

4. Find the mean value of the reciprocals of all quantities from n 
to 2«. 
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5. A ladder of length / can be safely used, without its being secured, 

77 77 

at any angle to the ground between and . If any angle between 

4 3 ^ 

these limits is equally likely, find the mean vertical height reached by 
the ladder in an infinite number of placings. 

6. OP is a straight line of length p. A fixed point O is taken in OP 
such that OQ = q. Two other points are taken at random in OP. Find 
the chance that they both fall in OQ, 

7. A point is taken at random on a given finite straight line of 
length a. Find the mean value of the sum of the squares on the two 
parts of the line. Find also the chance of the sum being less than this 
mean value. 

8. A semicircle, APB, stands on a base, AB, of length 2r. P is a 
point on the circumference, and AP, BP are joined to form the right- 
angled triangle, APB. Find the mean value of the area of the triangle: 

(1) if P is a point chosen on the circumference at random; 

(2) if P is fixed by choosing a point N at random in the base AB, 

and erecting a perpendicular from N, to meet the circum¬ 
ference at P. 

9. Find the mean value of the ordinate of a semicircle, the points 
along the diameter at which the ordinates are taken being equidistant. 

10. In two opposite sides of a square, whose side is of length a, 
points P and Q are taken at random and are joined by the line PQ 
which thus divides the square into two pieces. Find the mean value 
of the area of the smaller of the two pieces. 

11. Find the mean of the square of the distance of a point within a 
given square of side za from the centre of the square. 

12. A straight line of length a is divided at random at two points. 
Find the mean value of the product of the three segments. 

13. A point is taken at random within the area bounded by y = jc log x, 
the x:-axis and the ordinates x=iy jc = 4. Find the probability that the 
distance of the point from the jy-axis is less than 2. 

14. Three points are taken at random on the circumference of a 
circle. Find the chance that the sum of any two of the arcs thus cut off 
is greater than the third. 

15. A straight line is divided into three parts by two points taken 
at random. Find the chance that none of the three parts is greater than 
five-eighths of the line. 
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16. There are two clerks in an office, each of whom goes out for an 
hour for lunch. One may start at any time between 12 and i o^clock, 
the other at any time between i and 2. All times within these limits 
are equally likely. Find the chance that they are not out together. 

17. Find the chance that the roots of the equation x^ — 2ax + h = o 
are real, where a and b are positive proper fractions chosen at random. 

18. The point M is the centre of a line LMN of length 4a. Two 
points P, Q on the line are chosen at random. Find the chance that 
the sum of the two distances MP and MQ is greater than a, 

19. If on a straight line of length {a-^b) two lengths a and b are 
cut off at random, find the chance that the common part does not 
exceed a length c. 

20. In a line AB of length 3^2, a point P is taken at random and then 
in AP a point Q is taken at random. What is the probability that 
PO exceeds a} 

21. The sides of a rectangle are taken at random each less than 
an inch and all lengths are equally likely. Find the chance that the 
diagonal is less than an inch. 

22. Three points are taken at random on the circumference of a 
circle. Find the probability that they lie in the same semicircle. 

23. Two points are taken at random on a given straight line of 
length a. Prove that the probability of their distance exceeding a given 

length c {<a) is equal to 

24. OA and OB are straight lines of length a at right angles to one 
another. P and Q are points taken at random in OA and OB respectively. 
Find the chance that the area of the triangle OPQ is less than 

25. A starts to walk from X to Y 2 miles apart at 3 miles per hour. 
On the journey he unknowingly drops his handkerchief, but discovers 
his loss when he has covered half the remaining distance. He then 
proceeds to retrace his steps at 4 miles per hour. B starts on the same 
journey at 3 miles per hour 5 minutes after the handkerchief is dropped. 
Find his chance of reaching the place where it was dropped before 
A does. 

Assuming that B picks it up, find the mean distance he would have 
to carry it to restore it to A if the series of events were to take place a 
large number of times. 

26. Find the mean distance between two points on opposite sides of 
a square whose side is unity. 
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1. Find the sum of n terms of the series i, 2, 4, 9, 19, 36, 62, .... 

2. A person writes four letters and four envelopes. If the letters are 
placed in the envelopes at random, what is the chance that not more 
than one letter is placed in its correct envelope.^ 

3. Find the mean age and median age of the following distribution: 


Age 

Population 

1,000’s 

Age 

Population 

IjOOO’s 

20— 

382 

55 - 

148 

25- 

379 

60- 

“5 

30- 

348 

65- 

84 

35- 

316 

70- 

S 2 

40— 

272 

75 - 

26 

45- 

234 

80- 

II 

50- 

194 

85- 

4 


4. Given Mo= 1027, Me= 1212, Mi2= Wi 8 = 20I4, explain 

(i) how you would complete the series from Wq to Wg; 

(ii) how you would proceed if you were asked to complete the 

series from Wg to Wjg supposing that it were unnecessary to 
find Mj, Mg, M3, M4 and Mg. 

5. The faces of a cubical die are marked i, 2, 2, 4, 6, 6. Find the 
chance that, in ten throws, four 2's, two 4's, four 6's are thrown. 

6. Find the tenth term of the series: 

{a) I, 4, 13, 36, 97, 268, 765, 

{b) 2, 12, 36, 98, 270, 768, .... 

7. Show that AMa;t;a! = t;a;AMa. + Ma.^iAt?a5 and hence prove by mathe¬ 
matical induction that 

A^M^. + W(1) At;^. A”~X+1 + «(2) A^x 

8. Explain the mathematical meaning of the word “probability^’. 
If the probability of the happening of an event at each trial be and 
n trials are to be made: 

(1) What does/)« represent? 

(2) What is the probability that the event will happen at least 

m times? 

(3) What is the most probable number of times the event will 

happen? 
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9. Prove that in the process of obtaining divided differences of the 

function given the last divided difference is numerically 

the same whatever the order of the arguments and the corresponding u's. 

10. Show that 

-( 2 ) v( 3 ) 

(1) = , Ao”*h— rA2o^ + ...; 

2I 3I 

v(2) v(3) 

(2) 'Lu^ = C + x^^%-{- Amo + —, AX + ---* 

11. Two Companies A and B make simultaneous issues each of 

1000 bonds. 7 'hose of Company A are redeemable by equal drawings 
spread over 20 years, and those of B by equal drawings spread over 
40 years. Find, in the case of two definite bonds, one of each issue: 

(1) the probability that the bond of Company B is redeemed before 

the bond of Company A ; 

(2) the probability that the bond of Company B is redeemed before 

the bond of Company A and within 15 years of issue. 

12. If tto» ^5? WjQ, be four values of a function at equidistant points, 
find expressions true to third differences for Uq and Wg, solely in terms 
of Wo, «6, «io and w^s. 

13. Explain what is meant by the terms mediany histograrriy ogivey 
illustrating your answers by rough diagrams. 

14. Two throws are made, the first with three dice and the second 
with two. What is the probability both that the first throw is not less 
than 11 and that the second throw is not less than 8 } 


15. Show that the series whose nth term is 


1.3.5 ••• ( 2 ” - 3 ) 

gn-l 




is equivalent to 


^ (^ 05^*+2 "k * * * )• 

16. Obtain the approximate quadrature formula 


fU 

= ^ (27W0 + 17W1 + 5W2 - W3). 

J-i 


17. The numbers of members in a Friendly Society were available 
for the following years: 


Year 1922 1923 1924 1925 1928 

Number of members 995 998 1003 996 976 
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It was desired to obtain estimates for the years 1926 and 1927. This 
was effected on the assumption of a constant fourth difference. Sub¬ 
sequently it was discovered that the numbers for 1926 were actually 
1002, and a fresh estimate for the year 1927 had to be prepared. Calcu¬ 
late the original estimates for 1926 and 1927, and find the revised figure 
for the year 1927. 


18. If 6 and c are positive quantities (h>c) and if 


— be 


have any value between b and all such values being equally likely, 
find the probability that x is real. 


19. Establish the formula for the difference of a product of two 
functions, 

A {u^v^) = 

and deduce the formula for the sum of a product, 

n--l n—l 

x —0 x --^0 


71 - 1 

By means of this formula find the value of E x"^. 

a*— 0 

a+x 

20. If E Mg. = integral values of a, prove that, to the third 

x—a 

order of differences, 

Uy = *2% ~ *008 - 2 W ^ + Wq ), 


Given the following table, find W7, Wjg and 


a 

0 

5 

10 

15 

20 

a +4 

E 

•0427 

•1467 

•2459 

•3408 

•4317 


21. If events Ay B, and C are independent of each other, and events 
E and F are mutually exclusive and are both contingent upon the hap¬ 
pening of A, give an expression for the probability that either E or F 
will happen and that neither B nor C will happen. 

22. The equation 

+ 3 = 15^ 

has a root between i and 2. Obtain it to three places of decimals by 
inverse interpolation. 
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23. The following formulae for approximate integration are correct 
to third differences: 


r+3 

J = l ( 3*^-2 + 2 «o + 3 W 2 )» 

r +3 

Prove that if these formulae are applied to a function whose fifth 
differences are constant, the respective errors involved in the approxi¬ 
mations are in the ratio 7:18, and are in opposite directions. 

By a combination of the two formulae obtain an expression, correct 
f 

to fifth differences, for J Ug,dx, 

24. Define standard deviation^ Find the mean and standard deviation 
of the following distribution: 


Length in cnis. 

No. of cases 

10- 

2 

15 - 

— 

20- 

8 

25” 

II 

3 O” 

12 

35” 

16 

40- 

25 

45” 

17 

50“ 

8 

55- 

I 


25. If third differences of u are constant, express in terms of 

Atti, 

26. If three different whole numbers from i to 30 inclusive are 
chosen at random, what is the probability that their sum is equal to 30? 

27. In each of two adjacent sides of a square a point is taken at 
random. Find the chance that the length of the line joining the two 
points is between \a and a, where a is the length of a side of the square. 

28. Draw the graph of jv = a:®- 3^-I-i. By reference to the graph, 
supplemented by arithmetical trials, find approximately the value of the 
negative root of the equation a:® —3a;+i=o, correct to two places of 
decimals. 
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29. A and B throw in turn with two dice, A having the first throw. 
A is to win either (i) if he throws a double six in his first six throws 
or (2) if he throws a double six before B has a throw scoring 9 or more. 
Find an expression for A's chance of winning. 

30. Find the mean length of a straight line drawn from one of the 
angular points of an equilateral triangle to a point taken at random in 
any one of the sides. 

31. Find an expression, correct to fourth differences, for the value 

of ~ when i, in terms of u^. 


32. Show that, if fourth differences of u are constant, 

(i [5] - 0 (“»+2-“2) = 

33. Prove that 

dx I / I \ a 

Jo 1+^" 4^2 \ oa/ + 

approximately, where a is a positive integer. 

By putting ^2 = 3, obtain the value of tt to three places of decimals. 

34. Three numbers are selected at random, one at a time, from the 
five numbers i, 2, 3, 4 and 5, repetitions being allowed. Find the prob¬ 
ability that the third number selected is not less than the second and the 
second is not less than the first. 


35. By means of the formula (taking « = 10) 


x=7nn n—1 

^ ^a+x ” ^ ^ ^a+nx ~ (^a+mn ^o) 
a :-«0 x =0 2 


n^-i 


12 




CQ ! 

find the approximate value of logjo ^, given that logiQ 2 = *3010, 

logic 3 = •477i» logio 6* = •4343- 

36. Given that Uq^ 16 

«! + Mo = 64 

+ + 266 


+ + + = 1029 

find the values of and m^, on the assumption that A^m^. is constant. 

37. Through two points taken at random in a diagonal of a square, 
two straight lines are drawn parallel to one of the sides and to each other. 
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Find the probability that the area of that part of the square between 
the two lines is not less than one-third of the area of the whole square. 

38. The number of herds of cows in a certain district is as follows: 


Number of 

Number of 

cows in herd 

herds 

I-io 1 

3^1 

11-30 

538 

31-40 

227 

41-50 

199 


Estimate the total number of cows in the 550 largest herds. 

39. A and B play a game in which they toss a coin alternately. If 
the coin turns up head the thrower scores two points; if it turns up 
tail his opponent scores one point. The winner is the one who first 
scores at least seven points. A has scored four points and B two, and it 
is ^’s turn to toss the coin. Find the chance that A will win the game. 

40, In a lottery there have been sold 2500 tickets numbered 0001 
to 2500 and 5000 10^. tickets numbered 2501 to 7500, and it is arranged 
to draw £,$00 worth of tickets, the holders of which will receive 10 times 
the face value of their tickets. The following method of drawing the 
winning tickets has been suggested. Ten counters marked with the 
figures o to 9 respectively are to be placed in a bag. A counter is to be 
taken at random from the bag and replaced; this is to be done four times 
and the figures obtained are to be written down in the order in which 
they are drawn to form a four-figure number. Drawings are to be 
carried out in this way, the tickets bearing the numbers so drawn to 
participate in the prize, until £$00 worth of tickets have been drawn, 
subject to the provision that any drawing giving a number to which 
there is no corresponding ticket or a number which has already been 
drawn or a number which would cause more than £$00 worth of tickets 
to be drawn is to be ignored. 

The following criticisms of the suggested method have been made: 

(i) The holders of the 105. tickets have an advantage over the 
holders of the £i tickets, since although the lo^. tickets cost 
only half as much as the £1 tickets, they have the same chance 
of being drawn. 

(ii) The system by which it is possible for a number to be drawn 
to which there is no corresponding undrawn ticket produces 
inequalities in the chances of the various ticket-holders. 
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(iii) The system of making the drawings is inequitable as it does 
not ensure that the correct proportions of £i and 105. tickets 
will participate in the prize. 

You are asked briefly to consider these criticisms and to advise whether 
the method is fair in all its aspects. 

41. A school contains 330 children, all of whom were born in the 
eleven years 1920-1930. Four times the total number of children born 
in any group of five consecutive years is equal to five times the number 
born in the first and last years of the group plus ten times the number 
born in the middle year of the group. The total number of children 
born in 1924, 1925 and 1926 is three less than three times the number 
born in 1925. Find the number born in 1925. 

42. Show that 

(PUx I /aQ A'i V 

dx^ ~ zh^ ^ ^ ^x~2h) 

approximately, where h is the interval of differencing, 

43. Establish Hardy^s formula “39^'^; and use a Hardy fonnula to 

find the value of (i +oc^)~^dx correct to four places of decimals. 

Jo 

44. A and B play a match of seven games. A'$ chances of winning, 
drawing and losing any game are as 5:3:2. One point is scored for a 
win and half a point for a draw. Find the chance that the match is 
drawn. 


45. Prove that — Wg + ^^3 — • • • = — (i)^ ^“1 + where 

is a real positive quantity which diminishes as n increases, and 
Lt u„ = o. 

n—>00 

In the series i—J + ^ — y4-... find the value of and prove that 
this series is equivalent to the series 


1 

2 


[■ 


I 

F- + 
3 


3-5 3 - 5-7 



46. Use the conception of finite differences to prove that the general 
term in the recurring series UQ-hUiX + U2X^-hUgX^ -h... (scale of relation 
I —px — qx^) is of the form* Aa^ 4 - Bd^, where a and d are functions of 
p and and A and B are constants. 

Prove that every series whose coefficients form an arithmetical pro¬ 
gression is a recurring series, and that the generating function is 

a'\-{d—a) X 


where a is the first term and d the common difference of the progression. 


F M AS 


21 
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47. A bag contains four black balls and eight white balls. Two balls 
are drawn at a time and replaced, this operation being performed six 
times. Calculate the probability that two black balls are not drawn four 
times consecutively. 

48. Obtain a formula for the finite integration of any rational integral 
function of x and apply it to find the sum to n terms of the series whose 
rth term is -h i) (r — 2). 

49 * ^25:30 ~ ^ ^30:30 — 15*7^45 ^25:35= 15*^^^ > 

^86:30 = ^4*42^5 <^25:40=^4*^24; ^30:36 = ^5*209* 

Find as accurately as possible <227:32* 

50. A and B toss a coin in turn, a head counting two and a tail one. 
The winner is the person who first scores a total of exactly three. If 
either tosses a head when his score is already two, his score is reduced 
to one. Calculate ^’s chance of winning the game if he has the first toss. 

51. By successive approximations based on the values of — 5^;: 4 - 3 
when a: = 0, i, 2 and 3, find to two places of decimals the smallest positive 
root of the equation — 5JC 4- 3 = o. 

52. Two points are chosen at random on the circumference of a 
circle of radius r. Find the chance that the length of the chord joining 
them is less than r \/3- 

53. A die whose sides are marked, i, 2, 3, 4, 5, 6 is thrown five times. 
Find the probabilities: 

{a) that the product of the five throws is 432; 

{b) that the sum of the first three throws is exactly three more 
than the sum of the last two throws. 

54. Complete the series to Mjg by means of Everett’s formula: 

X -s o 5 10 15 20 25 

6i*o 91*4 113*6 134*2 179*4 238-0 296-2 

55. The probability that A will die within ten years is *2 and the 
probability that A, B and C will all be alive ten years hence is *42. The 
probability that at least one of the three will be alive ten years hence 
is *985. Find the probability that A and B alone will be living at the 
end of the tenth year. 

56. Explain clearly the meaning of the term dispersion in statistics. 
What are the most usual methods of measuring dispersion? Indicate 
the advantages and disadvantages of these methods. 
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57. A and B throw for a certain stake, each throwing with one die; 
die is marked 2, 3, 4, 5, 6, 7 and B*s i, 2, 3, 4, 5, 6. Prove that A's 

expectation is 47/72 or 21/31 of the stake, according as equal throws 
divide the stake or go for nothing. 

58. Let (f> (xi)y 0 (^2)» ^ (^n) be the values of a function </> (x) 

corresponding to n equidistant values of x distributed over the range 
h-a. Define the mean value of <f> (jc) over this range (i) as a limit, and 
(ii) as a definite integral. 

Extend your definitions to a function </> {Xy y) of two variables x, y 
for the ranges b~a of x and ^~a of y. Hence find the mean value of 

(e^ + e--) (ey + e -y) 

for I, o^y^i, 

59. Four variables x, yy ^ and u are connected by the equations 

looy = I -f 
I02r = 2 + 7y^ 

«= 3 + 5 - 

If you were told to construct a table showing the value of u (but not 
the values of y and z) corresponding to each integral value of x from 1 
to 20, how many values of u would it be necessary for you to compute 
by means of the above equations before you could fill in the remaining 
values by a finite difference method? 

Write down the formula by which you would calculate these remaining 
values. 

60. In an examination the numbers of candidates who obtained 
marks between certain limits were as follows; 


Number of marks 
0-19 
20-39 

40-59 

60-79 

80-99 


Number of candidates 

41 

62 

6 S 

50 

17 


Estimate the number of candidates who obtained fewer than 70 marks. 

61. Explain Sheppard’s rules. 

Use the rules to express in terms of 

(i) «o, Aao, A*k_i, A®«_2, A«m_j, A®«_s, A®k_3; 

(ii) tto, Aw_i, A*m_i, A»«_„ A®«_i, A®m_2, A*«_,. 


21-2 
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62. (i) Explain, giving a simple example in each case, the meanings 
of {a) a probability depending on mutually exclusive events, {b) expecta¬ 
tion and (c) probable value. 

(ii) If two dice are thrown and six coins are tossed, find the probability 
that the difference between the numbers shown by the dice is equal to 
the number of heads shown by the coins. 

63. (i) In a straight line AB of length two points X and Y are 
chosen at random. Using a geometrical method, find the probability 
that XY exceeds a length b (where b<a). 

(ii) If X is first chosen at random in AB and Y is then chosen at 
random in AXy explain carefully with the aid of a diagram why a similar 
method cannot be used and find the probability in this case. 

64. The equation x^ + 2x — 20 — o has a root between 2*4 and 2*5. 
Determine the value of this root correct to four decimal places by a 
method of inverse interpolation. 

65. A large army consists of men between ages 20 and 40, the number 
at age x being proportionate to a + bc^^ where a, b and c are constants. 
If the numbers at ages 20, 30 and 40 are proportionate to 100, 68 and 20 
respectively, find, correct to one decimal place, the average age of the 
men in the army. Given 

•ogio2 = -30i. Iogio3 = ’477. and logic « = •4343- 

66. If three whole numbers are chosen at random, what is the chance 
that their product is a multiple of 2j} 

67. The winner of a game is the one who first scores four points, with 
the proviso that if two players score three points, the game continues 
until one player has scored two points more than the other. A's skill is 
to B's as 2:1. Find A's chance of winning the game if he owes one 
point and B receives a start of one point. 

/A^\ 

68. (i) Prove that f — j (interval of differencing //). 

(ii) If be a function of the form 

bj^x-h b^x^ 4 - b^x^ +.. . to infinity, 
show that it can be expressed in the form 

b^x Ab.x^ A^b.x^ 
i—x ( i - a ;)2 ( i -; c )^ 

69. On March ist a plant 6 inches high was put into a greenhouse. 
On March 8th its height was found to be lo inches, and on March 29th 
14 inches. Estimate its height on April 12th 
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(i) assuming that the height {h) is a rational integral function of 

the time (^), 

(ii) assuming that h and t are connected by a relation of the form 

h^a-\- bV ^; 

and explain why one method should give a better answer than the other 
in this particular case. 

70. The following data are available: 

Age ^ 32 37 42 47 52 57 

35*36 33-25 3072 27-23 23-16 19-11 

It is desired to obtain ^57j with as little labour as possible, and it is 
suggested that 18*71 would be a reasonable approximation. Do you 
agree with this? Give reasons. 

From the above data, obtain a value for 

71. A die is thrown repeatedly until the total of the numbers turned 
up reaches 5 or more. What is the probability that one of these numbers 
is a 6? 

72. If iij. be a function whose differences, when the increment of x 

is unity, are denoted by ••• ^^xy ^^^xy ^^^xy ••• 

when the increment of x is n\ then if ••• geometric 

progression with common ratio show that 

73. Prove that, approximately, 

125W0 = [5]^ («o + ^W-2 - A«i)- 

74. Two men throw for a guinea, equal throws to divide the stake. 
A uses an ordinary die, but B uses a die marked 2, 3, 4, 5, 6, 6. Show 
that B thereby increases his expectation by 5/i8ths. 

75. A bag contains five counters marked i to 5. One counter is 
drawn at a time and replaced. What is the average number of draws 
required in order that counter No. 5 may be drawn twice? 

76. If K9=6, 

«10 = 27 , 

«20 = 62, 

find the value of x satisfying the equation 

Uy, + Wjo+x + ^20+3. = (ttio- X + ^20-a; + 4 ^* 
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77. The goals scored by two teams A and B in a football season were 
as follows: 


Number of goals 
scored in a match 

Number of matches 

A B 

0 

27 17 

I 

9 9 

2 

8 6 

3 

5 5 

4 

4 3 


By calculating the coefficient of variation in each case, find which 
team may be considered the more consistent. 

78. Find ^2:2 from the following table of using all the values 

given: ^ 

- _ 

012 
fo 16-25 13-49 

>>-1 15-55 14-58 13-15 

U 14-48 13-74 

79. During a certain month of the year it is reckoned that if at any 
time the weather is fine the odds are four to one against its breaking up 
within a week. At the beginning of this month, while it is fine, a gardener 
applies to his lawn a weed-killer for the success of which it is essential 
that the weather should remain fine for two days but should break up 
within fifteen days. Find to two places of decimals the chances that the 
weed-killer is successful. 

80. A match consisting of a maximum of five games is played between 
A and B. The chances that any game is won by Ay won by B or drawn 
are equal. Each player scores one point for every game he wins and half 
a point for every game drawn. The match ends as soon as one of the 
players has a sufficient lead to leave him with an excess of points over 
his opponent even if the latter were to win all the remaining games. 
If a large number of such matches were played, what proportion of 
them would you expect on the average to end with the third, fourth 
and fifth game respectively, and what proportion would you expect to 
be drawn? 

81. If Mjc is a rational integral function of the fourth degree in Xy and 

if = = + i =u^ + 2Uj=Oy and is negative, find for 

what values of n greater than 4 

n—1 

6(«-2) S «* = («-!) 
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82. is a rational integral function of the third degree in 

and <f>{x)^f (^) -f-/ {t - x\ show that f (x) dx can be expressed in 

Jo' 

i 

the form S + (o)-f (i) + c<^ (2), where a, h and c are in- 

dependent of and find the values of these constants. 

83. Four people Ay By D and twelve others are to be seated at 
random at a round table. Find the chances that: 

(i) these four people are arranged in two separated pairs, A and 

B together, and C and D together, 

(ii) all four are separated from one another. 

84. Explain the difference, if any, between and A^y, 

Find ^44:61, given 

^40:50= 10*894 %>: 65 = 9*796 « 40:60 = 8*553 

^45:50=10*591 ^45:55 = 9*583 

^50:50= 10*059 

85. A certain type of tag consists of a “bootlace'' with a cylinder at 
one end into which the tag at the other end fits. If any number N of 
exactly similar tags be held in the middle so that the cylinder ends 
hang down at one side and the tag ends at the other: 

(a) what is the chance that if, say, n tags be fitted into n cylinders 
at random, both ends have been chosen from the same “ boot¬ 
lace", so that n loops are formed? 

{b) if all the N tags be fitted into all the N cylinders, what is the 
chance that one large loop is formed? 

86. Three metal discs are numbered i, 2, 4 respectively on one side: 
the other side of each disc is blank. The discs are tossed three times, 
and the numbers showing up are added. A is to win a stake from B 
if the total is 8 to 13 inclusive, while B wins if the total is less than 8 
or more than 13. Find the odds in favour of B's winning. 

87. In a certain town the post office has four telegraph boys who 
take it in turns to deliver telegrams. If a resident in the town receives 
telegrams on n separate occasions, what is the chance that they will be 
delivered by at least three of the telegraph boys ? 

88. A player throws two dice simultaneously and is to continue 
throwing them so long as no figure less than a 5 turns up. If two 6’s 
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turn up simultaneously he is to receive a prize amounting to £12 on the 
first occasion it happens, ^13® on the second occasion, £13® on the third 
occasion and so on. If two 5*s turn up simultaneously he is to receive 
£2 un the first occasion, £2^ on the second occasion, £2^ on the third 
occasion and so on. 

Find the value of his expectation. 

89. If interpolated values are found in the interv^al a; = o to x=i 
from the values w_i, Uq, Wj, Wg, by means of the formula 

«* = 

+ (where ^ = i — x), 

and in the next interval x~i to x~2 by the corresponding formula 
based on the values «2 show that: 

(1) The given values z/q, z/j and zz^ will be reproduced by the inter¬ 

polation. 

(2) The two interpolation curves have the same differential coefficient 

when A? = I. 

(3) The interpolated values for zzj and zz| agree with those given by 

the ordinary third difference interpolation formula based on the 
same values of u^. 

Given the following values: 

w_6 = 1000, z/jo = 2609, 

tto=i 403 , 1/16 = 3487, 

complete the table for unit intervals from Uq to z/jo by the above formulae 
and calculate the value of the differential coefficient of the interpolated 
curves when a; = 5. 

90. A bag contains 10 counters numbered i to 10 respectively. A 
counter is drawn at random and replaced in the bag; this is done three 
times, the numbers on the counters so drawn being added together to 
give a total. The process is repeated as many times as may be necessary 
to obtain three totals each of which does not exceed 12, any series of 
three drawings which produces a total exceeding 12 being ignored. 

Find the chance that these three totals will (if arranged in ascending 
order of magnitude) form an arithmetical progression of common 
difference 2. 
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91. Having given the present value of per annum at the end of 
20 years at the undermentioned rates of interest per cent.: 

Rate per cent. Present value of £ i per annum 

2 16-351433 

2I i5-c8qi62 

3 14-877475 

3 i 14-212403 

4 13-590326 

4J 13-007936 

use Everett’s formula of interpolation to find the present value of 
per annum at the end of 20 years at 3*2 per cent, per annum. 

92. The estimated distribution of cinema admissions in a certain year 
was as follows: 


Price of admission 
in pence 

Number of admissions 
in millions 

2 

13-6 

3 

74-4 

4 

52*2 

6 

272-5 

9 

164-7 

12 

193-2 

15 

53-8 

18 

91-1 

20 

47-8 


Give a sketch of a frequency distribution, showing the median and 
the mode. 

93. If -r is a whole number chosen at random between i and 99 
inclusive, find the chance that ^(loo — ^c) will exceed one-half of its 
maximum possible value. 

94. A floor is ruled with equidistant parallel lines. A rod, shorter 
than the distance between the lines, is thrown at random on the floor. 
Prove that the chance that it falls on one of the lines is 2^/77^, where c 
is the length of the rod and a the distance between the parallels. 

95. If in an examination six men are bracketed, the extreme difference 
between their marks being 6, find the chance that they have all obtained 
different marks. 
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96. The xth term of the series i, 2, 17, 72, 243, 754,is of the form 

+ Determine a, b, c, d and find the sum of n terms of the 

series. 

97. A team of eight men is engaged in a practice shoot. Each man's 
probability of hitting the bull on his target with any one shot is f. 
Each man fires one shot; then those who have not hit the bull fire a 
second shot, and so on, the unsuccessful men at each trial firing again. 

What is the minimum number of trials that should be made in order 
to ensure an even chance of all the men hitting the bull? 

98. In a certain game the player throws a ball on to a board con¬ 
taining twelve holes into one of which the ball is bound to fall. It is 
equally likely to fall into any one of the holes, which are marked 
o, 0, o, o, I, I, I, I, I, 2, 2, 2 respectively. If it falls into a hole marked 
o it is ‘‘lost’*, but if it falls into any of the others it is returned to the 
player, who continues to throw the ball until it is eventually “lost”, 
when the game ceases. At each throw the player scores the number of 
the hole into which the ball falls. 

Show that the probability that the player obtains a total score of n 
is the (n-l-i)th term of a certain recurring series and hence find this 
probability. 

99. A and B both wish to journey from X to Z by tram. A decides 
to take a tram which travels direct from X to Z m 55 minutes, while 
B decides to go by a different route, taking first a tram which travels 
from X to Y in 20 minutes and then another which travels from Y 
to Z in 30 minutes. Trams run from X to Z, from X to Y and from 
F to Z at intervals of 10 minutes, 7 minutes and 7 minutes respec¬ 
tively, but the times of running are not known. If A and B arrive 
at the same moment at X to wait for their respective trams, find the 
chance that A arrives at Z before B. 

100. If = + + 

«;i = W3 + W4-hW6-l-W6 + «7> 

prove that 

«2 + *^3 = (^0 + ^1) ’<^32 
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7, 2®*+^ ~ 2 —5^ (2^4-1) (4^4-1). 8. J low^H-29«®4-low). 
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9 . 

11 . 

12 . 

13 . 

14 . 
16 . 
16 . 

17 . 

20 . 

22 . 

23 . 

24 . 

25 . 

26 . 
28 . 

29 . 

30 . 

33 . 

34 . 
37 . 
39 . 

42 . 

45 . 

46 . 

47 . 

48 . 

49 . 


2w*-f 1+ 47n^ + 6ow. 10. 2^® —2095, 

i (w + 3) (w + 4 ) (« + 5 ) (« + 6)-90. 

Jw (n+ i) (n + 4) (w + 5). 

{( 3 « - 2) (3W + i) (3W 4 - 4) (3w + 7) + 56}. 

(w+i)(n + 2) (3W+13). 

(n+i) (71 + 2) (6w® + 57 ”4-137)- 
\\) {( 27 x 4 - 3 ) ( 2 w-f 5 ) ( 27 x 4 - 7 ) ( 27 x 4 - 9 ) (27x4-ii)-10395}. 

_ n _ 7 x (57x4-13) 7 x (7x4- 1)_ 

4(714-4)’ * 12 (n4-2) (n4-3)‘ * 6 (7x4-3) (n-i- 4)* 

_.”.( 37 x 4 : 5 )_ 21 ” ( 5 ” 4 - ii) 

8 (37x4-1) (37x4-4)* * 4 (”+0 (” 4 ^ 2 )' 

} Jtx (tx4- i) (tx 4- 2) (37X® 4- 367X 4- loi). 

_IQ _ _127x^4-337x419_ 19 

168 6 (3714 i) (37x44) (37x47)’ 168* 

i\t 7 x ( 7 x 4 i) ( 37 x 2 4 3 1 ” 4 - 74 )- 

i (” + 3) (” + 2) ( 7 X 4 l) 7 X; iTX ( 7 i~i) ( 7 X- 2 ) ( 7 X- 3 ). 


3«_ I 


27 . 




a («4 i) 
(«-i)V 


4 C. 


i~3 ( 7 x 4 i)! 4 ( 7 X 4 2)!. 

in {(3”- i) (3” + 2) (3” 4 -5) (3” + 8) (37X 4 11) 488 oj. 

in (67X* 4 167x2 4 97X — 4). 31 . n { yn ^ — 347x2 ^ _ 254). 

i^oTX ( 7 x 4 i) (97x^4 177x44); {2” + ' (tx^ — 37 x ®4 iotx- I 4)}4 28. 

+ 76 T 34-(4«*+12w+I7)/2"->. 


(;v-2)3®. 38 . I (3”-1)4 iV (427X4 1771® 4 TX*). 


(7x4 2) X^ — 2^ 

a;—I (x-i)^' 


_2 Jg®4 36j^- ^4 28 jc4 3 

I2A: (:x :4 i) (a; 4 2) (x 4 3)* 


_1 (39 367x439 1 

54 lio (3”4-2) (3”4-5)J 


43 . 23. 


44 . 7 X^ 2 ”"^^. 


2 — ( tx 4 I) (tx -f- 2) JC” 

I — a: 




C-2“”2 


(^v-_i)! 
(2:^— i)!' 


( 27 X- i) ( 27 x 4 i) (27x43) (2477® 4 54”+ 25). 
iax ^ — ( ia ~ i / j ) x '^ + (ia — ib-h ic ) x ^ + db - ic - hd ) x - i - C . 


(i-xy+\ 



( 7 X 4 l)! _' 

(a 4 i) (a 4 2) ... (a 4 7 x). 


63 . 2 ”“* 7 X ( 7 x 4 i) ( 7 x ®4 57 X —2). 
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Examples 7 * 


2. 

20-796875; -bV- 


10. *02455; -*0003; 0. 

11. 

*109. 

13 . 

15 . 48; 24. 

19 . 

5 ' 254 - 

22. 13*094. 

23 . 5-319. 

24 . 

3 9634. 

26 . 10*389; 10*475. 




Examples 9 . 


2. 

•9921. 

8 . 20*4 miles. 

8 . 

lirT (95W0 - 50 ?A + 600W2 - 350 W 3 H- 425 W 4 )- 


9 . 

200 square inches. 

C' 

10 

00 

t -4 

*8806...; (ii) *8946.... 

12. 

The second. 

13 . 29,426. 14 . 888. 16 . ‘6942. 

22. 

•5236. 

24 . 3-142_ 25 . I 

-1358.... 

26 . 

•6931.... 

27 . 3-142.... 



Examples 10 . 


2. 

i; i. 

4 . 


5 . J. 

6. 

1. 

7 . 

7 

SB ' 

8. 


9 . m 


10 ^ 5 ^’ 
n!n! 

(r- 

11. 



12. 

iWff- 

18 . A. 


14 . iiS. 


16 . 

15 X35 

2 

17 . 

(a) 2"; 

(b) 2”"; 

(^) 

(m-f i) 

2 * 

18 . i8¥S*. 


19 . 

A- 

20. 

iV«5r- 

21. 

1M0 

. 22. 

138 X 56 
-8 

23 . 

iJ. 



(l) IssSoo; (2) iB®(lVoS' 26 . 27 . ii 53 a. 28 . 

29 . }J. 31 . 82 . 83 . ^U- 

84 . im- 35 . ,V 36 . -> 1 /^ 37 . .^4. 


38 . (i) 


/>* (P + 2?) 


(ii) 


39 . 


p^+pq + 

40 . 18 ?. 41 . .jyj. 42 . J?.i. 43 . 

44 . -126. 45 . 2X10-^ 46 . (i) J; (2) (3) 

47 . -105. 49 . '98304 or 09888. 60 . (i) *72; (2) *504. 

51 . (i) *099...; (2) *000007..., *001..,. 52 . *2109. 

53 . (a) *972; (b) *271. 54 . 


56 . 



67 . 7. 


68 . 


n + 3 

2m+8 • 


« 3 . (i) A; ( 2 ) rSil 

65 . 

9^* — 9« + 2 


66 . - 


log. 2 


; log, 2. 


67 . 
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80. 81. (i) (2) (3) ]h 82. r},. 

83. 84. 3000 : 3007. 85. J. 86. 88. 

89. ./j. 90. 81- (i) i'y (2) f; (s) i%- 

82. ,m. 83. W A; (« A; (c) A. 85. ^\V 

86 . 

Examples 11. 


1. 24-9... years. 2. 64-65...; 64-55.... 3. 77 (approx.). 


4. 32*1; 31-6. 5. 33-34. 6. 7-9...; 90.... 

10. II,- 6s. id. 11. (i) Between 5 and 6; (2) 4; (3) 7. 

12. 65. o^/.; 5^. od. 13, 355. o<i. 14. 6 : i. 16. Purse A. 

17. (i) 635.0^.; (ii) 56s. od. 18. ///i- ^8. |7n(»2-fi). 

20. 10. 21. (i) i^.; (2) (3) its. 22. M+Jm. 

23. (i) (2) iflly; (3) ffoViJ Most probable number is 4; 

24. 75. 2^/. 25. £43. 145. I id. ; £56. ss. id. 


Examples 12 . 

1. M=509-6; a = 8-o. 2, M=22-3; Mi—if)'$; ct=9-2. 

3. 9-2 and 6-2. 

35*3^- (jf the extreme values are omitted) or 34-7^. (if they are included). 
6. Mo = 8-6, M, = 7*9, Quartiles 10 and 6. M=y'5. 

6. a = 4-2. 7. ±-15 in. 8. -27 year. 

9. M=6o-7; Mi = sy'3; <7=14-25. 10. ^ is the more consistent. 

11. a= 14-7. 12. X is the more consistent. 


Examples 13. 


2. 

i (n + 2), 


3. 


4. *log, 2. 

5. ^^(^2—I 





n 

77 

6. 

9* 

!>*■ 

7. fa 

2 - 
> 

I 

V3* 

7 T 

(.) 

10. 

i-*. 

11. 

2a 

T 

2 

12. 

,3 8 log 2-3 

64 log 2-15* 

14. 

i. 

16. «}. 


16. i. 

17. i. 

18. 1. 19.^^*. 
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20. S-.Uog,3. 21. 22. i. 24. i + Jlog. 2. 

4 

Jf; miles. 26 . J (2 — Vz) + log (i + V^). 

Miscellaneous Examples. 

1 . ii«4'4). 2 . 

3- 40-3-.*; 37-7---- 5. jjfi-i- 

6 . (a) 19 , 764 ; W 50 , 156 . 11 . (i) U; ( 2 ) A* 


14 . 17 . 977,961; 1023. 18 . 

20. *0294; *0492; *0682. 22. 1-109. 24 . 3845; 9*4- 

26 . Uq + wAwi + in (n— 3) A^i/g 4- Jw (« — 4) (/i — 5) Ahi ^. 


26 . 

AJiy- 


27 . 

3-^ 

16 ■ 

28 . 

— !■ 

00 

00 

29 . 



(Ur l¥: 

i,. 30 . .Ja + la 

loge 

3 - 


34 . 



36 . 

58 - 932 I.... 

36 . 

86; 

I 2 I. 

37 . 

4 


38 . 

20,536. 

39 . 

^4 • 


41 . 

45 - 


43 . 

•7854- 

44 . 

*109... . 

47 . 

I-- 5 -V 

11° 


49 . 

15 - 975 - 

50 . 

i 


51 . 

•66. 


52 . 

f. 

53 . 

180 

65 ~ 

780 
’ “6^ • 

54 . 

113-6, 117- 

0, 120*4, 124*2, 128*8, 134*2, 141*4, 

149 

-4, 158*6, i68*6, 


179*4. 







65 . 

•14 or *18. 


58 . 



60 . 

196. 

62 . 

(ij) iliV- 


63 . 

(i) ia-b)Va\ 


64 . 

2*4695 • 

65 . 

28-0. 


66. 

Vi’ 


67 . 


69 . 

(i) 10 in. 

(ii) 

15*8 in. 

70 . 2908. 




71 . 



75 . 

10. 


76 . 

3 - 

77 . 

B is the more consistent. 78 . 12*50. 




79 . 

•32. 


80 . 

#r, A, Ji- 


81 . 

7 , 13 - 

82. 

a= — 5, b = 

= 4 , c 


83 - (0 iMb 

: (ii) 1? 


84. 

10*476. 


85 . 

(AT-w)! 1 

Nl ’ N' 


86 . 

15 .* 17 * 

87 . 

fi( 4 "- 6.2 

4 

" + 8). 

88 . 15 s. 




89 . 

1403,1498, 

1598,1703,1 

814,1931, 2054, 2182, 23 

16, 2458, 2609; 120-6. 

90. 

m- 


91 . 

14-606063. 


93 . 

a- 

05. 



96 . 

J{2"+* + 3 "+,- 7 «*- 

-M — 

7 }* 


97. 

5- 


98. 



99. 

m- 



INDEX 


Tlie numbers refer to the pages 


Addition rule, 216 
Adjusted differences, 56 
Advancing difference formula, 32, 
50 

Aitken, A. C., 19, 40, 119, i5^» 

165, 169 

Aitken’s notation, 41 
theorem, 158 
Algebraic series, 112 
Alison, S. H., 11 
Approximate integration, 173 
Approximation, successive, 89 
A priori definition of probability, 
208 

Argument, 3 
Array, 254 
Averages, 258 

Backward formula. Gauss’s, 64 
Bernouilli, 270 
Bernouilli’s numbers, 189 
Bessel’s formula, 64, 171 
Binomial curve, 290 
Binomial theorem, 12, 26 
Boole’s Finite Differences^ 19 
Buchanan, J., 139, 150. i 53 . i 97 

Central differences, 61 
Chance, 210 

Change of origin, 32, 178 
Change of scale, 32 
Change of unit, 177 
Class-interval, 256 
Class range, 257 
Coefficient of variation, 287 
Coefficients, detached, 21 
Collins, John, 30 
Compound function, 132 
Compound probability, 218, 221 
Comrie, L. J., 171 
Contact of the first order, 148 
Correlation table, 213 
Cross-means, interpolation by, 165 
linear, 165 
quadratic, 168 

D’Alembert, 207, 270 
De Morgan, 40, 124, 270 


De Morgan’s Differential and Integral 
Calculus, 193 
Dependent events, 221 
Detached coefficients, 21 
Difference table, 3 
Differences, adjusted,, 56 
central, 61 
definition of, 3 
divided, 39 
leading, 4 

Differences of zero, 123 
Dispersion, 279 
Divided differences, 39 
Double integrals, 301 
Duration of play, 240 

Elderton’s Frequency Curves and 
Correlation, 197 

Elimination of third differences, 90 
Entry, 3 
Euler, 52 

Euler-Maclaurin expansion, 188 
Everett’s formula, 66 
second formula, 67 
Expectation, 267 
Expected value, 268 
Extrapolation, 38 

Factorial notation, 19 
Finite differences, i 
Finite integration, 107, 161 
Forward formula. Gauss’s, 63, 65 
Fraser, D. C., 30, 40, 46, 56, 72, 115, 
118, 142, 145, 152, 153 
hexagon diagram, 142 
Freeman’s Actuarial Mathematics, 19, 
40 

Frequency, 255 
Frequency curve, 259 
Frequency distribution, 257 
Function, compound, 132 
rational integral, i 
two variable, 135 

Gauss’s backward formula, 64 
forward formula, 63, 65 
Geometrical solutions, probability, 308 
Gregory, James, 30 
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Hall and Knight’s Higher Algebra, loz 
Hardy, G. F., 119, 184 
Hardy’s formulae, 184 
Henderson, R., 153 
Henry’s Calculus and Probability, 40 
Hexagon diagram, 142, 145 
Histogram, 256 
Huyghens’s problem, 227 

Indefinite finite integral, 107, 163 
Independent events, 218 
Induction, method of, 237 
Integration, approximate, 173 
by parts, i6i 
Interpolation, 26 

by cross-means, 165, 168 
formulae of, 29, 44, 51, 63, 64 
inverse, 81 
osculatory, 147 
Intervals, equal, 26 
subdivision of, 33 
unequal, 39 

Inverse interpolation, 81 

Jenkins, W. A., 153 

Karup, Prof., 150 
Kerrich, J. E., 153 
King, A. E., 197 
King, G., 150, 185 

Lagrange’s formula, 51, 137, 139, 198 
Leading differences, 4 
Leading term, 4 

Lidstone, G. J., 46, 67, 72, 117, 119, 
131, 152, 159, 161, 162, 163, 171 
Linear cross-means, 165 
Lubbock’s formula, 191 

Maclaurin’s series, 160 
Mean, 259, 261 
Mean deviation, 280 
Mean value, 296 
Median, 259, 264 
Milne-Thomson, L. M., 19, 40 
Milne-Thomson’s Calculus of Finite 
Differences, 199 
Mode, 259, 265, 270 
Mortality tables, 229 
Most probable value, 270 
Multiplication rule, 219 
Mutually exclusive events, 216 

Neal, P. G., 150 
Newton, 56 


Newton’s formula, advancing dif¬ 
ferences, 29 
divided differences, 44 
Normal curve of error, 290 

Ogive curve, 265 
Operators, A, E, 6 
33 

A. 41 

8, ii, 76 
106 

A 126 
V, 146 
by A 158 
A, A, 161 

Origin, change of, 32, 178 
Osculatory interpolation, 147 

Parts, summation by, 104 
Poincard, 270 
Polynomial, i 
Prize, 267 
Probability, 204 

application of calculus to, 304 
geometrical solutions, 308 
Probable error, 289 
Probable value, 267 

Quadratic cross-means, 168 
Quadrature formulae, 185 
Quartile deviation, 286 
Quartiles, 286 

Radix, 229 
Range, 278 

Rational integral function, i 
Recurring series, 111 
Remainder terms, divided differences, 
57 

quadrature formulae, 198 
Reversion of series, 81 
Rolle’s theorem, 56 
Root-mean-square deviation, 281 

St Petersburg problem, 270 
Scale, change of, 32 
Semi-inter-quartile range, 286 
Separation of symbols, 16 
Series, algebraic, 112 
recurring, iii 

Sheppard, W. F., 40, 46, 47, 56, 72, 76, 
284 

Sheppard’s notation, 76 
rules, 45' 

Simpson’s rule, 174, i8i 
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spencer, J., 138 
Sprague, T. B., 150, 193 
Spurgeon’s Life Contingencies, 186 
Standard deviation, 282 
Standard error, 289 
Statistical definition of probability, 208 
Statistics, 254 
Stcffensen, J. F., 40 
Steffensen’s Interpolation, 19, 147, 199 
Stirling’s formula, 64 
Subdivision of intervals, 33 
Successive approximation, 89 
Summation, 98 
by parts, 104 
Summation w, 114 
Symbolic notation, 3 
Symbols, A, 3 
E, 6 

8, 33 . 76 

A. 41 
7b 

S, 106 
D, 126 
V, 146 
e, 0,158 

A, A, lA 

Symbols, separation of, 16 


Taylor’s theorem, 55, 126, 141 
Tchebycheff’s theorem, 284 
Term, leading, 4 
Thiele, 26 

Three-eighths rule, 181 
Throw-back device, 171 
Todhunter, R., 67, 119, 141 
Two variables, functions of, 135 

Unit, change of, 177 

Unitary definition of probability, 208 

Universe of an event, 217 

Variate, 254 

Variation, coefficient of, 287 
Weddle’s rule, 182 

Whittaker and Robinson’s Calculus of 
Observations, 72, 197 
Interpolation, 93 

Whitworth’s Choice and Chance, 270 
Wickens, C. H., 197 
Woolhouse’s formula, 193 
notation, 79 

Zero, differences of, 123 
Zig-zag rules, 47 
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